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POLYPEPTIDES OF 6- COUPLED RECEPTOR PROTEINS, 
AND COMPOSITIONS AND METHODS THEREOF 

FIELD OF THE INVENTION 
The present invention relates to compounds, 
5 compositions and methods involving synthetic, isolated and/or 
recombinant G- protein coupled receptor polypeptides that 
comprise fragments and/or consensus peptides of G-protein 
coupled receptors. 

BACKGROUND QF THE INVENTION 

10 The membrane protein gene superfamily of G-protein 

coupled receptors (GPRs) has been characterized as having seven 
putative transmembrane domains. The domains are believed to 
represent transmembrane a-helices connected by extracellular or 
cytoplasmic loops. Of the 74 sequenced members of this 

15 G-protein receptor superfamily, the shortest sequence of 324 

amino acids represents the rat mas oncogene and the longest, of 
744 amino acids, represents the human thyroid- stimulating 
hormone (TSH) receptor. GPRs thus include a wide range of 
biologically active receptors, such as hormone-, viral-, growth 

20 factor- and neuroreceptors. 

G-protein coupled receptors have been characterized as 
including these seven conserved hydrophobic stretches of about 
20-30 amino acids, connecting at least 8 divergent hydrophilic 
loops. The G-protein family of coupled receptors includes 

25 dopamine receptors which bind in a noncovalent but high affinity 
manner to neuroleptic drugs used for treating psychotic and 
neurological disorders. For example, the dopamine D 2 receptor 
includes these transmembrane domains, two of which (TM III and 
TM V; see below) have been implicated by site- selective 

30 mutagenesis to demonstrate functional, association with D, 
ligands . 

Transmembrane domains of G-protein coupled receptors 
are designated TM1, TM2, TM3, TM4, TM5, TM6 and TM7. TM4 , TM5 , 
TM6 and TM7 are the most highly conserved and are postulated to 
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provide sequences which impart biological activity to GPRs. 
Most GPRs have single conserved cysteine residues in each of the 
first two extracellular loops which form disulfide bonds that 
are believed to stabilize functional protein structure. TM3 is 
5 also implicated in signal transduction. 

Phosphorylation and lipidation (palmitylation or 
farnesylation) of cysteine residues can influence signal 
transduction of some GPRs. Most GPRs contain potential 
phosphorylation sites (e.g., serine or theronine residues) 

10 within the third cytoplasmic loop and/or the carboxy terminus. 

For several GPRs, such as the 0-adrenoreceptor, phosphorylation 
by protein kinase A and/or specific receptor kinases mediates 
receptor desensitization. 

Non-limiting examples of GPRs include cAMP receptors, 

15 adenosine receptors, ^-adrenergic receptors, muscarinic 

acetylcholine receptors, or-adrenergic receptors, serotonin 
receptors (5-HT), histamine H2 receptors, thrombin receptors, 
kinin receptors, follicle stimulating hormone receptors, opsins 
and rhodopsins, odorant receptors, cytomegalovirus receptor, 

20 etc. See e.g., Probst et al DNA and Cell Biology 11:1-20(1992), 
which is entirely incorporated herein by reference. 

The ligand binding sites of GPRs are believed to 
comprise a hydrophilic socket formed by several GPR 
transmembrane domains, which socket is surrounded by hydrophobic 

25 residues of the GPRs. The hydrophilic side of each GPR 

transmembrane helix is postulated to face inward and form the 
polar ligand binding site. TM3 has been implicated in several 
GPRs as having a ligand binding site, such as including the TM3 
aspartate residue. Additionally, TM5 serines, a TM6 asparagine 

30 and TM6 or TM7 phenylalanines or tyrosines are also implicated 
in ligand binding. 

GPRs can be intracellularly coupled by heterotrimeric 
G-proteins to various intracellular enzymes, ion channels and 
transporters. See, e.g., Johnson et al Endoc. Rev. 

35 10:317-331(1989) ; and Birnbaumer et al Biochem. Biophys . Acta 

1031:163-224(1990) which references are incorporated entirely 
herein by reference. GPR agonist binding catalyzes the exchange 
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of GTP for GDP on the a-subunit of the G-protein. Different 
G-protein a-subunits preferentially stimulate particular 
effectors to modulate various biological functions in a cell. 
Phosphorylation of cytoplasmic residues of GPRs has been 
5 identified as an important mechanism for the regulation of 

G-protein coupling of some GPRs. 

As a non- limiting example of a GPR ligand, dopamine 
(3,4-dihydroxyphenethylamine) is a critical neurotransmitter in 
the central nervous system (e.g., in the substantia nigra, 

10 midbrain, and hypothalamus) . Since the elucidation of the 

ascending mesolimbic and nigrostriatal pathways, these pathways 
have been found to be critical in the control of both motor 
initiation (nigrostriatal) behavior and affective (mesolimbic) 
behavior. The clinical efficacy of the major neuroleptic 

15 antipsychotic medications has been found to correlate with the 
respective affinities of these agents for the dopamine D 2 
receptor in the brain. A dopaminergic role in the 
symptomatology of the major psychoses has thus been 
hypothesized, although it is unclear if dopamine alone is 

20 etiological, (see, e.g., Davis et al. Am. J. Psych. 148:1474- 
1476 (1991)). Nonetheless, this hypothesis has served as a 
stimulus for current research in this area. 

One model for studying possible interactions of 
G-protein coupled receptors with their ligands has emerged from 

25 site-directed mutagenesis and biochemical analysis of the 

^-adrenergic receptor, as well as from biophysical analysis of 
the interaction of retinal with opsin. 

According to such a model, the binding of a GPR ligand 
to a G-protein coupled receptor involves multiple interactions 

3 0 between functional groups on the GPR ligand and residues within 
the hydrophophilic binding site of the receptor. 

While a number of the amino acid residues in the 
dopamine D 2 receptor have been postulated to participate in D : 
ligand binding, based on results obtained from site- directed 

3 5 mutagenesis studies and photoaf f inity labeling studies performed 
on the 0- adrenergic receptor, such studies have failed to 
specifically determine which residues are actually involved in 



WO 94/05695 



PCT/US93/08528 



binding in the D 2 system. Sibley et al . Sec. Neurosci. Abs. 
17:36.10, 324.5, 324.6 (1991). 

The clinical use of neuroleptics has provided a means 
for treating patients suffering from psychotic disorders. 
Short-term use of neuroleptics is indicated in several types of 
psychotic disorders, e.g., acute psychotic episodes, regardless 
of type; exacerbations of schizophrenia; acute manic excitement 
while deferring use of lithium or awaiting onset of its effects; 
adjunctive therapy for major depression with prominent psychotic 
symptoms, or when an antidepressant or ECT alone is not 
successful; for agitation in delirium, dementia, or severe 
mental retardation while seeking to identify and treat the 
primary basis of the problem; in certain chronic, degenerative, 
or idiopathic neuropsychiatry disorders with dyskinesias, such 
as Huntington's disease or Gilles de la Tourette ' s syndrome; or 
for ballism or hemiballism; childhood psychoses or apparently 
allied conditions marked by severe agitation or aggressive 
behavior; miscellaneous medical indications, notably nausea and 
vomiting, or intractable hiccups. 

Additionally, continuous long-term use of neuroleptics 
is indicated in many psychotic disorders, such f.s (for more than 
six months) (i) primary indications such as Schizophrenia, 
Paranoia 1,1 , Childhood psychoses, some degenerative or idiopathic 
neuropsychiatric disorders (notably, Huntington's disease and 
Gilles de la Tourette' s syndrome); (ii) secondary indications 
such as extremely unstable manic-depressive or other episodic 
psychoses (unusual) , otherwise unmanageable behavior symptoms in 
dementia, amentia, or other brain syndromes; and (iii) 
questionable indications such as chronic characterological 
disorders with schizoid, "borderline, " or neurotic 
characteristics; substance abuse; or antisocial behavior, 
recurrent mood disorders. See, e.g., Baldessarini , Chemotherapy 
in Psychiatry, Revised and Enlarged Edition, Harvard University 
Press, Cambridge, MA, (19B5) , the contents of which is entirely 
incorporated herein by reference. 

Neuroleptics are also referred to as neuroplegics, 
psychoplegics, psycholeptics , antipsychotics and major 
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tranquilizers, but are sometimes distinguished from non- 
neuroleptic anti-psychotics. Neuroleptics have recently been 
characterized as an agent that produces sedative or 
tranquilizing effects, and which also produces motor side 
5 effects, such as catalepsy or extrapyramidal symptomatology. 
Nonlimiting representative examples of neuroleptics include 
phenothiazine derivatives (e.g., chlorpromazine) ; thioxanthine 
derivatives (e.g., thiothixene); butyrophenone derivatives 
(e.g., haloperidol) ; dihydroindolone (e.g., molindone); 
10 dibenzoxazepine derivatives (e.g., loxapine) ; and "atypical" 
neuroleptics (e.g., sulpiride, remoxipiride pimozide and 
clozapine) . See Berstein Clinical Pharmacology Littleton, 
Mass.:PSG Publishing (1978); TJsdin et al Clinical Pharmacology 
in Psycnia try New York: Elsevier North- Holland (1981); and 
15 Baldessarini, supra, (19B5) ; and , which references are herein 
entirely incorporated by reference. 

The term "atypical neuroleptics" has been used to 
describe antipsychotic neuroleptics that produce few or no 
extrapyramidal side effects and which do not cause catalepsy in 
20 animals (See, e.g., Picket et al, Arch. Gen. Psychiatry 49 : 345 
(May 1992). Alternatively, atypical neuroleptics, such as 
clozapine, have been described as those neuroleptics which have 
a higher affinity for D 4 and D, sites than for D 2 sites (See, 
e.g., Davis et al Amer. J. Psych. 148:1474, 1476 (November 
25 1991) . 

The long term use of all known anti-psychotics, such 
as neuroleptics or non-neuroleptic antipsychotics, has resulted 
in serious side effects, as present in Table I, such as 
persistent and poorly reversible motoric dysfunctions (e.g., 

30 tardive dyskinesia) in a significant number of patients. These 
side effects are especially prevalent in geriatric populations, 
and adequate pharmacological treatment of these debilitating 
motoric dysfunctions is not currently available. This problem 
has severely limited the long-term, clinical administration of 

35 these agents. 
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TABLE I 

Neurological Side Effects of 
Neuroleptic -Antipsychotic Drugs 



Proposed mechanism Treatment 



Acute dystonia Spasm of muscles o1 1-5 days Dopamine excess? Antiparkinsoniom agants ai 
tongue, face, neck, Acetylcholine excess? diagnostic and curative 

back: may mimic (i.m. or i.v., then p.o.) 

seizures; not 



Parkinsonism Bradykinesia, rigidity. 5-30 days Dopamine blockade Antiparkinsonian! agents 
variable tremor, mask- (rarely <P-°1; dopamine agonists 

facies, shuffling gait persists) risky? 

Akathieia Motor restlessness; 5-60 days Unknown Reduce dose or ehangB drug 

patient may experience (commonly low closes of propranolol;" 

anxiety or agitation persists) antiparkinsonian! agents or 

or benzodiazepines may help 

Dopamine excess? Prevention best; treatment 

unsatisfactory) slow spontaneous 
:s irreversible, withdrawal) remission 



Perioral tremor (late Months of Unknown 
parkinsonism variant?); yeers 
usually reversible 

Catatonia, stupor. Weeks Unknown Stop neuroleptic; antiparkinsonian! 

fever, unstable pulse agents usually fall; bromocriptine 

and blood pressure; often helps; dantrolene variable; 

myoglobinemia; can general supportive oare oruoiel 



a. There mey be an increased risk of hypotension on interacting high doses of propranolol with some antipsychotic 
agents; donidine may also be effective at doses of 0,2-0,8 mg/day, but carnse a high tiak of hypotension (Zubenko et el., 
Psychiatry Res. 11:143, 1984). 

In addition, clozapine, although apparently capable of 
producing less motor side effects, can cause irreversible, 
potentially fatal agranulocytosis in a minority of patients 
administered the drug. Such serious side effects limit the use 
of 
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clozapine to patients who are resistant to treatment with other 
neuroleptics . 

Antipsychotics have a variety of significant 
pharmacological effects, e.g., as presented in the following 
Tables II and III. 

Table II 

Comparative Pharmacology of Neuroleptics 



Alkaloid 


Phenothiazlne 
Derivative 

Chlorpromazine 


Thloxanthene 
Derivative 

Thiothixene 


Butyrophenone 
Derivative 

HaloDeridol 


Antipsychotic 


Yes * + 


Yes + + 


Yes + + + + 


Antiemetic 


Yes + + + 


Not tested 


Yes + * + 


Hypothermia 




Yes + 


No 


Hypotension 


Yes + + 






Parkinsonism 


Yes + + 




Yes + + + + 


Anti adrenergic 


Yes * * 






Anticholinergic 




Yes * 


Negligible 


Antihistamine 




Negligible 


Negligible 


Releases HE, DA 




No 


No 


Blocks DA 


Yes + + 




Yes + + + + 


Blocks NE 


Yes + + 


Yes + + + 


Yes + 


Central sympathetic 


Yes + + 


Yes + 


Yes + + + 



suppressant 

CHorpfonvmn., thialhixana, and haknancfc.1 daoraa.a m. availab i lity ot dopamine IDA) «nd noraplnaphrt na (NE) by blocking th. 

dsoanarM Hnaptor >fan b tha baaal a»ngll. and WKpwfhin. rocaptor «rtw in ihalanvo and h V po*»l"iTiic araaa. Raaaipina cimply r«du« 
concfHUnrtion at nsnpinaphrina and dopamine in thaoa ar«n. Both of than action, mult in aupprOMian ot eantral aympathelic activity. 



Table III 

Comparative Pharmacology of Antipsychotics 



Adrenergic 

Sedation Blockage Reaction, 



Chlorpromazine High Moderate to high Moderate 

Chlcrprothixene High High Low to moderate 

Haloperidol Low Low High 

Molindone Moderate Moderate Moderate to high 

Lcocapine High Low to moderate High 



See Ebadi, PHARMACOLOGY, Little, Brown and Co., Boston, 61-65 
(1985); Cattabeni et al Adv. fliocheai. Psychopharmacology 24 : 2 75 
(1980), Baldeasarini, supra, which references are herein 
incorporated entirely by reference. 

However, despite the fact that thousands of neuroleptic - 
or antipsychotic- type compounds have been synthesized and reported 
in the literature, such compounds which lack serious side effects 
and which have sufficient pharmacological activity, have not been 
disclosed. 
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Alternative to dopamine receptor GPRs, as presented 
above, other neuroreceptor GPRs are involved in neurological 
pathologies, and drugs such as neuroreceptor GPR binding agents, 
presently used for treating these pathologies, also suffer from 
5 similar side effects as those of neuroleptics, as presented above. 
Other GPRs are also involved in receptor- related 
pathologies, such as hormone related GPRs involved in endocrine 
related pathologies. 

Accordingly, there is a need to provide G- protein coupled 

10 receptor binding agents, including neuroreceptor and endocrine 
receptor GPRs, which do not produce such deleterious and 
debilitating side effects as those produced by known agents, such 
as neuroleptics, which can be used for therapy or diagnosis of GPR 
related pathologies . 

15 Citation of documents herein is not intended as an 

admission that any of the documents cited herein is pertinent 
prior art, or an admission that the cited documents are considered 
material to the patentabilty of the claims of the present 
application. All statements as to the date or representations as 

20 to the contents of these documents are based on the information 
available to the applicant and does not constitute any admission 
as to the correctness of the dates or contents of these documents. 

SUMMARY OF THE INVENTION 

It is therefore an object of the present invention to 
25 overcome one or more deficiencies found in the related art. 

It is another object of the present invention to provide 
non-naturally occurring synthetic, isolated and/o^ recombinant GPR 
polypeptides which are fragments, consensus fragments and/or 
sequences having conservative amino acid substitutions, of at 
30 least one transmembrane domain of at least one G-protein coupled 
receptor, which polypeptides have been discovered to have 
receptor- like functional binding sites of neuroreceptor and 
endocrine GPRs, such that GPR polypeptides of the present 
invention may bind GPR ligands, or which may also modulate, 
35 quantitatively or qualitatively, GPR ligand binding to GPRs. 
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It is still another object of the present invention to 
provide GPR polypeptides and compositions that have only partially 
helical structures, in contrast to known characterized 
transmembrane domains of GPRs, such as, but not limited to, GPR 
5 transmembrane domains I -VII. 

It is yet another object of the present invention to 
provide synthetic or recombinant GPR polypeptides, conservative 
substitution derivatives thereof, antibodies, anti-idiotype 
antibodies, compositions and methods that can be used as potential 

10 modulators of G-protein coupled receptor function, by binding to 
GPR ligands or modulate GPR ligand binding, due to their expected 
biological properties, which may be used in diagnostic, 
therapeutic and/or research applications. 

It is a further object of Che present invention is to 

15 provide synthetic, isolated or recombinant polypeptides which are 
designed to inhibit or mimic various GPRs or fragments thereof, as 
receptor types and subtypes. 

According to one aspect of the present invention, a 
synthetic or recombinant GPR polypeptide is provided that 

20 comprises a GPR amino acid sequence of, e.g., at least 5, 10, 15 
or 20 amino acids, substantially corresponding to at least one 
transmembrane domain, or fragment and/or consensus peptide 
thereof, of a G-protein coupled receptor, wherein at least 20 
amino acids are preferred. In a preferred embodiment, the 

25 polypeptide is (a) chemically synthesized and/or {b) obtained from 
a recombinant host cell or organism which expresses a recombinant 
nucleic acid encoding a GPR polypeptide, as defined herein. 

In another preferred embodiment, the transmembrane domain 
is selected from at least one of TMl, TM2, TM3 , TM4 , TM5, TM6 or 

3 0 TM7 , corresponding to transmembrane domains I, II, III, IV, V, VI 
and VII, respectively, of a GPR. In another preferred embodiment, 
the transmembrane domain is a dopamine receptor transmembrane 
domain selected from the group consisting of at least one of a D,, 
Dj. D 3I D 4 and D 5 dopamine receptor transmembrane domain. The 

35 transmembrane domain, e.g., may be selected from at least one of D : 
receptor transmembrane domains III or V. In still another 
preferred embodiment, the GPR polypeptide amino acid sequence 
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substantially corresponding to an amino acid sequence contained in 
at least one of Fig. 2 (SEQ ID N0:2) , Fig. 3 (SEQ ID N0:3) or Fig. 
5 (SEQ ID NO: 5) . 

In another aspect of the present invention, a GPR 
5 composition is provided, comprising a GPR polypeptide, or a 
pharmaceutical^ acceptable ester, ether, sulfate, carbonate, 
malate, glucuronide or salt thereof, the composition further 
comprising a pharmaceutically acceptable carrier and/or diluent, 
in still another aspect of the present invention, a 

10 method is provided for treating a subject suffering from a disease 
state involving a qualitative or quantitative pathological 
abnormality of a GPR protein or a biological molecule functionally 
associated therewith. Such biological molecule may be a membrane 
cytoplasmic protein, lipid, carbohydrate, saccharide, nucleoside 

15 or nucleotide mono-, di-, or tri -phosphate, an enzyme, a co- 
factor, a nucleic acid, a neurotransmitter, an ion, a carrier, a 
cell receptor, or any combination thereof. 

In a preferred embodiment, the GPR protein is a dopamine 
receptor and the abnormality involves a dopamine related 

20 pathology, wherein the method comprises administering an effective 
dopamine receptor modulating amount of a GPR polypeptide of the 
present invention. In another preferred embodiment, the 
transmembrane domain is a D 2 dopamine receptor domain and the 
disease state is a psychiatric disorder, such as schizophrenia or 

25 schiz affective disorder (see American Psychiatric Association, 
Revised Manual of Diagnostic and Statistical Criteria for 
Psychiatric Disorders (DSM-III-R) . American Psychiatric Abboc. 
press, Washington, DC (1939)). 

In another preferred embodiment, the GPR composition is 

3 0 administered as a pharmaceutical composition to provide a GPR 

polypeptide in an amount ranging from about 0.01 jxq to 100 mg/kg, 
and also preferably, about 10 ^g to 10 mg/kg. In another 
preferred embodiment, the administering is by oral, intravenous, 
intramuscular, parenteral or topical administration, including 

3 5 mucosal administration to the nasal mucosa or the oral mucosa, by 
aerosol, nebulizer or drop administration as non- limiting 
examples . 
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Other objects of the invention will be apparent to 
skilled practitioners from "he following detailed description and 
examples relating to the present invention. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is the amino acid sequence of a control peptide 
(SEQ ID NO:l), which is hydrophobic in its properties, but does 
not correspond to a known GPR transmembrane domain. 

Fig. 2 represents the amino acid sequence of a GPR 
transmembrane polypeptide, polypeptide II {SEQ ID NO:2) , which 
corresponds to a portion of the dopamine D, receptor transmembrane 
segment ill. 

Fig. 3 represents the amino acid sequence of a 
transmembrane polypeptide, polypeptide III (SEQ ID NO: 3), 
corresponding to a consensus peptide of the dopamine D 2 receptor 
transmembrane domains I -VI I. 

Fig. 4 represents the amino acid sequence of a consensus 
sequence of transmembrane domains that is shortened to be less 
than the length required to span a lipid bilayer. 

Fig. 5 represents a consensus amino acid sequence of 
transmembrane domain as a consensus peptide between dopamine 
receptors D, and D,. 

Fig. 6 is a representation of a circular dichroism 
spectrum of a solution of the consensus polypeptide III (SEQ ID 
NO: 3) of Fig. 3. 

Fig. 7 is a graphical representation of radioligand 
binding assay data comparing control polypeptide II {SEQ ID N0:1) 
of Fig. l, labeled as n II" and consensus polypeptide I (SEQ ID 
NO:3) of Fig. 3, labeled as "I". 

Fig. 8A-G are a comparison listing of amino acid 
sequences of transmembrane domains and adjacent amino acid 
sequences of representative GPRs (SEQ ID NOS:6-79). 
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DETAILED DESCRI PTION OF THE PREFERRED EMBODIMENTS 

The present invention relates to G-protein coupled 
receptor (GPR) polypeptides which can be used to mimic naturally 
occurring or isolated GPRs, or to modulate the binding of GPR 
5 ligands to GPRs, such as inhibition or enhancement of binding. 
GPR polypeptides of the present invention can include GPR 
transmembrane domain fragments and/or consensus peptides thereof, 
of at least 4-10 amino acids in length, and/or corresponding 
sequences having conservative amino acid substitutions as 

10 "substitution peptides", wherein the GPR polypeptide binds a GPR 
ligand or modulates the binding of a GPR ligand tD a GPR in vitro, 
in vivo or in situ. 

GPR polypeptides of the present invention can be 
synthesized or recombinantly produced, or optionally purified, to 

15 provide commercially useful amounts of GPR polypeptides for use in 
therapeutic, diagnostic or research applications, according to 
known method steps, see, e.g., Ausubel et al, eds_. Current 
Protocols in Molecular Binlnov. Wiley Interscience , N.Y., (1987, 
1992); and Sambrook et al, Molecular Cloning. A Laboratory Mam^T . 

20 2nd edition, Vols. 1-3, Cold Spring Harbor Press, (1989), which 
references are herein entirely incorporated by reference. 

Additionally, GPR polypeptides according to the present 
invention can be used to generate polyclonal and/ or monoclonal 
antibodies, anti- idiotype antibodies thereto, or fragments 

25 thereof, which may used for diagnostic and/or therapeutic 

applications, according to known method steps, see, e.g., Harlow 
and Lane, Antibodies: A Labor atory Manual . Cold Spring Harbor 
Press {1988) , which is herein entirely incorporated by reference. 

GPR polypeptides, anti-GPR antibodies or anti-idiotype 

30 antibodies (or fragments thereof) to GPR polypeptides have been 
unexpectedly discovered to quantitatively or qualitatively 
modulate G-protein coupled receptors, such that binding of GPR 
polypeptides or anti-idiotype antibodies (or fragments thereof) to 
G-protein coupled receptor ligands may be used for diagnostic 

35 research or therapeutic applications of the present invention. 
Such GPR polypeptides, antibodies or anti-idiotype antibodies of 
the present invention may therefore be used as modulators of 
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G-protein coupled receptors, such as neuroreceptors or endocrine 
receptors, as non- limiting examples. 

Binding of such GPR polypeptides, (including GPR 
fragments, consensus peptides, substitution derivatives and anti- 
5 idiotype ancibody fragments) of the present invention may be used 
to treat symptoms of, and provide diagnosis and treatment for, 
pathologies related to GPRs. Such pathologies have been found to 
correlate with symptoms occurring in neurological, viral or 
endocrine pathologies. D 2 receptor- related psychotic disorders, 

10 including schizophrenia, now treated with neuroleptics, is a non- 
limiting example thereof. 

The use of synthetic or recombinant GPR polypeptides of 
the present invention can be preferable to the use of known drugs 
that bind G-protein coupled receptors, such as neuroleptics that 

15 bind or inhibit the biological effect of binding to neuroreceptors 
as a non- limiting example. Such polypeptides are expected to have 
significantly less side effects than presently used drugs 
presently used for inhibiting such receptor binding including 
neuroleptics, as they would structurally mimic naturally occuring 

20 GPRs and/or modulate ligand binding. Thus, GPR polypeptides are 
expected to have reduced side effects attributable to known 
foreign compound drugs, with less immunogenicity , and reduced 
potential for motoric side effects {e.g., extrapyramidal symptoms 
and/or tardive dyskinesia} . 

25 The present invention is also related to the production, 

by chemical synthesis or recombinant DNA technology, of GPR 
polypeptides, preferably as small as possible while still 
retaining sufficiently high affinity or interaction with G-protein 
coupled receptors to modulate, such as to inhibit or to enhance, 

30 binding to such receptors by GPR ligands. 

GPR polypeptides of the present invent ion may include 5- 
10 to 50-150 amino acid fragments, consensus sequences or 
substitution sequences of GPRs, e.g., as presented in Fig. 8A-G 
(SEQ ID N0S:6-79) including, but not limited to, multiple dopamine 

35 receptors, camp receptors, adenosine receptors, /8-adrenergic 
receptors, muscarinic acetylcholine receptors, a-adrenergic 
receptors, serotonin receptors (5-HT), histamine H2 receptors. 
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thrombin receptors, kinin receptors, follicle stimulating hormone 
receptors, opsins and rhodopsins, odorant receptors, 
cytomegalovirus GPRs, adenosine A2 receptors, dopamine receptor, 
histamine H2 receptors, octopanmine receptors, N-formyl receptors, 
anaphylacoxin receptors, thromboxane receptors, IL-8 receptors, 
platelet activating factor receptors, endothelin receptors, 
bombesin gastrin releasing peptide receptor, neuromedin B 
preferring bombesin receptors, vasoactive intestinal peptides, 
neurotensin receptors, bradykinin receptors, thyrotropin -releasing 
hormone receptors, substance P receptors, neuromedin X receptors, 
adrenal angiotensen II type I receptors, mas oncogene 
(angiotensin) receptors lutropin- choriogonadotropin receptors, 
thyrotropin receptors, follicle stimulating hormone receptors, 
cannabinoid receptors, glucocorticoid- induced receptors, 
endothelial cell GPRs, testis GPRs , and thoracic aorta GPRs, and 
homologs thereof having a homology of at least 80S; with at least 
one of transmembrane domains 1-7, as described herein. See, e.g., 
Probst et al DNA and Cell Biology 11:1-20(1992}, which is entirely 
incorporated herein by reference. 

Accordingly, a "G-protein coupled receptor polypeptide" 
or "GPR polypeptide" of the present invention includes 
polypeptides having a "GPR amino acid sequence" which 
substantially corresponds to at least one 10 to 50 amino acid 
fragment and/or consensus sequence of a known GPR or group of 
GPRs, wherein the GPR polypeptide has homology of at least 80%, 
such as 81, 82, 83, 84, 85, 86, B7, 88, 89, 90, 91, 92, 93, 94, 
95, 96, 97, 98, 99 or 100% homology, while maintaining GPR 
modulating activity, wherein a GPR polypeptide of the present 
invention is not naturally occurring or is naturally occurring but 
is in a purified or isolated form which dpes noj: occur in nature. 
Preferably, a GPR polypeptide of the present invention 
substantially corresponds to a transmembrane domain of a GPR or 
group of GPRs as a consensus sequence. 

Also preferred are GPR polypeptides wherein the GPR amino 
acid sequence is 4-10 to 50 amino acids in length, such as 5, 6, 
7, 8, 9, 10, 11, 12, 13; 14, 15, 16, 17, 18, 19, "0, 21, 22/23,' 
24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39,' 



WO 94/05695 



PCT/US93/08528 



- 15 - 

40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 60, 70, 80, 90, 100, 
110, 120, 13 0, 140 or 150 amino acids, or any range therein. 

An amino acid or nucleic acid sequence of a GPR 
polypeptide of the present invention is said to "substantially 
5 .correspond" to another amino acid or nucleic acid sequence, 

respectively, if the sequence of amino acids or nucleic acid in 
both molecules provides polypeptides having biological activity 
that is substantially similar, qualitatively or quantitatively, to 
the corresponding fragment of at least one GPR transmembrane 

10 domain, or which may be synergistic when two or more transmembrane 
domains, consensus sequences or homologs thereof are present. 

Additionally or alternatively, such "substantially 
corresponding" sequences of GPR polypeptides include conservative 
amino acid or nucleotide substitutions, or degenerate nucleotide 

15 codon substitutions wherein individual amino acid or nucleotide 
substitutions are well known in the art. 

Alternatively or additionally, substantially 
corresponding refers to GPR polypeptides having amino acid 
sequences having at least 80% homology or identity to an amino 

20 acid sequence of SEQ ID N0:1, such as 80, 81, 82, 83, 84, 85, 86, 
87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99 or 100% 
homology or identity. 

Accordingly, GPR polypeptides of the present invention, 
or nucleic acid encoding therefor, include a finite set of 

25 substantially corresponding sequences as substitution peptides or 
polynucleotides which can be routinely obtained by one of ordinary 
skill in the art, without undue experimentation, based on the 
teachings and guidance presented herein . Por a detailed 
description of protein chemistry and structure, see Schulz, G.E. 

3 0 et al., Principles of Protein Structure, Springe r-Ver lag, New 
York, 1978, and Creighton, T.E., Proteins: Structure and 
Molecular Properties, W.H. Freeman & Co., San Francisco, 1983, 
which are hereby incorporated by reference. For a presentation of 
nucleotide sequence substitutions, such as codon preferences, see 

35 Ausubel et al, supra, at §§ A. l.l- A. 1.24, and Sambrook et al, 
supra, at Appendices C and D. 
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Conservative substitutions of a GPR polypeptide of the 
present invention includes a variant wherein at least one amino 
acid residue in the polypeptide has been conservatively replaced 
by a different amino acid. Such substitutions preferably are made 
5 in accordance with the following list as presented in Table IV, 
which substitutions may be determined by routine experimentation 
to provide modified structural and functional properties of a 
synthesized polypeptide molecule, while maintaining the receptor 
binding, inhibiting or mimicking biological activity, as 
10 determined by known GPR receptor activity assays. 



Table IV 



Original 


Exemplary 


Residue 


Substitution 


Ala 


Gly; Ser 


Arg 


Lys 


Asn 


Gin; His 


Asp 


Glu 


Cys 


Ser 


Gin 


Asn 


Glu 


Asp 


Gly 


Ala; Pro 


His 


Asn; Gin 


He 


Leu; Val 


Leu 


lie; Val 


Lys 


Arg; Gin; Glu 


Met 


Leu; Tyr; He 


Phe 


Met; Leu; Tyr 


Ser 


Thr 


Thr 


Ser 


Trp 


Tyr 


Tyr 


Trp; Phe 


Val 


He; Leu 



Alternatively, another group of substitutions of GPR 
polypeptides of the present invention are those in which at least one 
amino acid residue in the protein molecule has been removed and a 
different residue inserted in its place according to the following 
5 Table V. The types of substitutions which may be made in the 
protein or peptide molecule of the present invention may be based on 
analysis of the frequencies of amino acid changes between a 
homologous protein of different species, such as those presented in 
Table 1-2 of Schulz et al., supra and Figs. 3-9 of Creighton, supra . 
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Based on such an analysis, alternative conservative substitutions are 
defined herein as exchanges within one of the following five groups: 

TABLE V 

1. Small aliphatic, nonpolar or slightly polar residues: Ala, Ser, 
Thr (Pro, Gly) ; 

1. Polar, negatively charged residues and their amides: Asp, Asn 
Glu, Gin; 

3. Polar, positively charged residues: 
His, Arg, Lye; 

4. Large aliphatic, nonpolar residues: 
Met, Leu, He, Val (Cys) ; and 

5. Large aromatic residues: Phe, Tyr, Trp. 

The three amino acid residues in parentheses above have 
special roles in protein architecture. Gly is the only residue 
lacking any side chain and thus imparts flexibility to the chain. 
This however tends to promote the formation of secondary structure 
5 other than a-helical. Pro, because of its unusual geometry, tightly 
constrains the chain. It generally tends to promote 0- turn- like 
structures, although in some cases Cys can be capable of 
participating in disulfide bond £brmation which is important in 
protein folding. Note the Schulz et al. would merge Groups l and 2, 

10 above. Note also that Tyr, because of its hydrogen bonding 
potential, has significant kinship with Ser, and Thr, etc. 

Conservative amino acid substitutions according to the 
present invention, e.g., as presented above, are known in the art and 
would be expected to maintain biological and structural properties 

15 of the polypeptide after amino acid substitution. Most deletions and 
insertions, and substitutions according to the present invention are 
those which do not produce radical changes in the characteristics of 
the protein or peptide molecule. "Characteristics" is defined in a 
non- inclusive manner to define both changes in secondary structure, 

20 e.g. a-helix or |8-sheet, as well as changes in physiological 
activity, e.g. in receptor binding assays. 

However, when the exact effect of the substitution, 
deletion, or insertion is to be confirmed one skilled in the art will 
appreciate that the effect of the substitution or substitutions will 

25 be evaluated by routine screening assays, either immunoassays or 
bioassays to confirm biological activity, such as receptor binding 
or modulation of ligand binding to the corresponding GPR. See, e.g. , 
Maranges et al., eds., for example, a substituted polypeptide 
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typically is made by site-specific mutagenesis of the peptide 
molecule -encoding nucleic acid, expression of the mutant nucleic acid 
in recombinant cell culture, and, optionally, purification from the 
cell culture, for example, by immunoaf f inity chromatography using a 
5 specific antibody on a chemically derivatized column or immobilized 
membranes or hollow fibers (to absorb the mutant by binding to at 
least one epitope) . 

A preferred use of this invention is the production, by 
chemical or recombinant DNA technology, of GPR polypeptides, 
10 preferably as small as possible while still retaining sufficiently 
high affinity for binding to, or association with, GPRs. By 
production of GPR polypeptides including smaller fragments or 
variants of such transmembrane domains, one skilled in the art, using 
known binding and inhibition assays, can readily identify the GPR 
15 polypeptides capable of binding minimizing or modulating G-protein 
coupled receptors using known methods. Non-limiting examples of 

fragments of GPRs to be used as GPR polypeptides or as a basis for 
consensus sequences thereof for GPR polypeptides, are presented in 
Figs. 2-5 and Fig. 8A-G, wherein fragments or consensus sequences of 
20 10 to 50 amino acids of at least one sequence of Figs. 2-5 or 
corresponding to at least one transmembrane domain or domains 1-7 
listed in Fig. 8A-G {SEQ ID NOS:6-79) are encompassed by the present 
invention, such as at least one transmembrane domain of one or more 
GPRs, such as a cAMP receptor (1), adenosine receptors (2-3); 
25 muscarinic acetylcholine receptors {4-8} ; human adrenergic receptors 
(9-ll, 14-16, 19-25, 28),- adrenergic receptors (9-28); human thrombin 
receptor (31); endothelin receptors (35-36) , bombesin receptors (37- 
3B), endocrine receptors (48-50), rhodopsin (51). opsins (52-54), 
odorant receptors (55-64), and cytomegalovirus GPRs (72-54), as non- 
30 limiting examples, wherein (»#-) refers to the listed sequences in 
Fig. 8A-G. 

Accordingly, GPR polypeptides may include consensus 
sequences and/or fragments of at least one of transmembrane domain 
1-7 of one or more GPRs as presented in Figs. 2-5 (SEQ ID NO : 2 - 5 ) or 
3 5 Fig. 8A-G. (SEQ ID NOS:S-79} or homologs thereof, which GPR 
polypeptides do not occur naturally, and/or which are provided in an 
isolated and/or purified form not found in nature. 
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Consensus peptides of GPR polypeptides of the present 
invention may include peptides which are distinct from known GPR 
sequences in critical structural features, but which are derived from 
consensus sequences of homologous GPR transmembzane domains 1-7, 
e.g., as presented in Fig. 8A-G (SEQ ID NOS:6-79). Such consensus 
peptides may be derived by molecular modeling, optionally combined 
with hydrophobicity analysis and/or fitting to model helices, as 
non- limiting examples. Such modeling can be accomplished according 
to known method steps using known modeling algorithms, such as, but 
not limited to, ECEPP, INSIGHT, DISCOVER, CHEM-DRAW, AMBER, FRODO and 
CHEM-x. Such algorithms compare transmembrane domains between 
related G-protein coupled receptors, determine probable energy- 
miminized structures and define alternative consensus polypeptide 
fragments . 

Such consensus peptides or fragments of GPRs may then be 
synthesized or produced recombinantly , in order to provide GPR 
polypeptides according to the present invention which mimic, modulate 
or inhibit binding of ligands to G-protein coupled receptors. GPR 
ligands, in the context of the present invention, refer to biological 
molecules that bind GPRs in vitro, in situ or in vivo, and may 
include hormones, neurotransmitters, viruses or receptor binding 
domains, thereof, opsins, rhodopsins, nucleosides, nucleotides, 
coagulation cascade factors, odorants or pheremones, toxins, colony 
stimulating factors, platelet activating factors, neuroactive 
peptides, neurohumors, or any biologically active compounds, such as 
drugs or synthetic or naturally occurring compounds. 

The following non- limiting examples of consensus peptides 
of GPRs of the present invention are provided by way of guidance and 
not by way of limitation. In GPR polypeptides of the present 
invention, one or more, preferably 4-10, Asp and/or Lys residues may 
additionally be incorporated at the carboxy and/or amino terminal 
ends in order to provide expected helix forming effects of the helix 
dipole effect, e.g., as described in Baldwin et al Biochem. 23:2130 
(1989); Baldwin et al Proc. Nat'l Acad. Sci. USA 84:8898 (1987); and 
Baldwin et al Proc. Nat'l Acad. Sci. USA 86:5286 (1989), which 
references are entirely incorporated herein by reference. 
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As a non- limiting example of GPR polypeptide of the present 
invention, dopamine receptor transmembrane fragments of D, 
transmembrane domain (e.g., domain III) as presented in Fig. 2 {SEQ 
ID NO: 2) or a consensus sequence as presented in Pig. 3 (SEQ ID 
5 KT0:3), e.g., of D 2 domains I- VII. Additionally or alternatively a 
consensus sequence may include less than 20 amino acids, such as 15 
amino acids corresponding to a transmembrane domain, such as a D, 
receptor domain, as presented in Fig. 4 (SEQ ID NO: 4) as polypeptide 
IV, which is smaller than the length required by spanning an average 

10 lipid bilayer of a cell membrane. 

However, in the context of the present invention, GPR 
polypeptides of greater than 15 -20 amino acids are preferred such 
that the Gpr polypeptides are able to span the lipid bilayer. 

Another non- limiting example of a GPR polypeptide using 

15 dopamine receptor transmembrane domains is a consensus sequence of 
two or more GPR receptors, such as the dopamine D, and D 2 receptors. 
A non- limiting example of such a consensus GPR polypeptide is 
presented in Fig. 5 (SEQ ID NO: 5) . 

Additionally, modified amino acids or chemical derivatives 

20 of amino acids of consensus or fragments of GPRs proteins, according 
to the present invention may be provided, which polypeptides contain 
additional chemical moieties or modified amino acids not normally a 
part of the protein. Covalent modifications of the peptide are thus 
included within the scope of the present invention. such 

25 modifications may be introduced into a GPR polypeptide by reacting 
targeted amino acid residues of the polypeptide with an organic 
derivatizing agent that is capable of reacting with selected side 
chains or terminal residues. The following examples of chemical 
derivatives are provided by way of illustration and not by way of 

30 limitation. 

Aromatic amino acids may be replaced with D- or 
L-naphylalanine, D- or L-Phenylglycine, D- or L-2-thieneylalanine, 
D- or L-1-, 2-, 3- or 4-pyreneylalanine, D- or L-3- thieneylalanine, 
D- or L- (2 -pyridinyl) -alanine, D- or L- { 3 -pyridine 1 ) -alanine, D- or 

35 l- (2 -pyrazinyl) -alanine, D- or L- (4-isopropyl) -phenylglycine, 
D- (trif luoromethyl) -phenylglycine, D- (trif luoromethyl) -phenylalanine, 
D-p-fluorophenylalanine, D- or L-p-biphenylphenylalanine, D- or 
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L-p-methoxybiphenylphenylalanine, D- or L- 2 -indole (alkyl) alanines, 
and D- or L-alkylainines where alkyl may be substituted or 
unsubstituted methyl, ethyl, propyl, hexyl, butyl, pentyl, iso- 
propyl, iso-butyl, sec-isotyl, iso-pentyl, non-acidic amino acids, 
5 of C1-C20. 

Acidic amino acids can be substituted with non- carboxylate 
amino acids while maintaining a negative charge, and derivatives or 
analogs thereof, such as the non-limiting examples of (phosphono) - 
alanine, glycine, leucine, isoleucine, threonine, or serine; or 

10 sulfated (e.g., -S0 3 H} threonine, serine, tyrosine. 

Other substitutions may include unnatural hyroxylated amino 
acids may made by combining "alkyl" (as defined and exemplified 
herein) with any natural amino acid. Basic amino acids may be 
substituted with alkyl groups at any position of the naturally 

15 occurring amino acids lysine, arginine, ornithine, citrulline, or 
(guanidino) -acetic acid, or other (guanidino) alkyl -acetic acids, 
where "alkyl" is define as above. Nitrile derivatives (e.g., 
containing the CN-moiety in place of COOH) may also be substituted 
for asparagine or glutamine, and methionine sulfoxide may be 

20 substituted for methionine. Methods of preparation of such peptide 
derivatives are well known to one skilled in the art. 

In addition, any amide linkage in any of the GPR 
polypeptides can be replaced by a ketomethylene moiety, e.g. (-C(=0) - 
CHj-) for ( - (C=0) -NH- ) . Such derivatives are expected to have the 

25 property of increased stability to degradation by enzymes, and 
therefore possess advantages for the formulation of compounds which 
may have increased in vivo half lives, as administered by oral, 
intravenous, intramuscular, intraperitoneal, topical, rectal, 
intraocular, or other routes. 

30 in addition, any amino acid representing a component of the 

said peptides can be replaced by the same amino acid but of the 
opposite chirality. Thus, any amino acid naturally occurring in the 
L- configuration (which may also be referred to as the R or S, 
depending upon the structure of the chemical entity) may be replaced 

35 with an amino acid of the same chemical structural type, but of the 
opposite chirality, generally referred to as the D- amino acid but 
which can additionally be referred to as the R- or the S-, depending 
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upon its composition and chemical configuration. Such derivatives 
have the property of greatly increased stability to degradation by 
enzymes, and therefore are advantageous in the formulation of 
compounds which may have longer in vivo half lives, when 
5 administered by oral, intravenous, intramuscular, intraperitoneal, 
topical, rectal, intraocular, or other routes. 

Additional amino acid modifications of amino acids of GPR 
polypeptides of to the present invention may include the following: 
Cysteinyl residues may be reacted with alpha-haloacetates (and 

10 corresponding amines), such as 2-chloroacetic acid or 
chloroacetamide, to give carboxymethyl or carboxyamidomethyl 
derivatives. Cysteinyl residues may also be derivatized by reaction 
with compounds such as bromotrif luoroacetone, alpha-bromo- 
beta- (5- imidozoyl) propionic acid, chloroacetyl phosphate, 

15 N-alkylmaleimides, 3 -nitro-2-pyridyl disulfide, methyl 2-pyridyl 
disulfide, p-chloromercuribenzoate, 2-chloromercuri-4-nitrophenol J 
or chloro- 7 -ni trobenzo - 2 - oxa- 1,3- diazole . 

Histidyl residues may be derivatized by reaction with 
compounds such as diethylprocarbonate e.g., at pH 5.5-7.0 because 

20 this agent is relatively specific for the histidyl side chain, and 
para-bromophenacyl bromide may also be used; e.g. , where the reaction 
is preferably performed in 0.1 M sodium cacodylate at pH 6.0. 

Lysinyl and amino terminal residues may be reacted with 
compounds such as succinic or other carboxylic acid anhydrides. 

25 Derivatization with these agents is expected to have the effect of 
reversing the charge of the lysinyl residues. Other suitable 
reagents for derivatizing alpha -amino -containing residues include 
compounds such as imidoesters/e.g. , as methyl picolinimidate; 
pyridoxal phosphate; pyridoxal; chloroborohydride ; 

30 trinitrobenzenesulfonic acid; O-methylisourea; 2,4 pentanedione ; and 
transaminase -catalyzed reaction with glyoxylate. 

Arginyl residues may be modified by reaction with one or 
several conventional reagents, among them phenylglyoxal , 
2,3-butanedione, 1,2-cyclohexanedione, and ninhydrin according to 

35 known method steps. Derivatization of arginine residues requires 
that the reaction be performed in alkaline conditions because of the 
high pKa of the guanidine functional group. Furthermore, these 
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reagents may react with the groups of lysine as well as the arginine 
epsilon- amino group. 

The specific modification of tyroayl residues per se is 
well-known, such as for introducing spectral labels into tyrosyl 
5 residues by reaction with aromatic diazonium compounds or 
tetranitromethane. N-acetylimidizol and tetranitromethane may be 
used to form 0-acetyl tyrosyl species and 3-nitro derivatives, 
respectively. 

Carboxyl side groups (aspartyl or glutamyl) may be selec- 

10 tively modified by reaction with carbodiimides (R» N-C-N-R' ) such as 
l-cyclohexyl-3- (2-morpholinyl- (4-ethyl) carbodiimide or l- 
ethyl-3- (4-azonia-4,4- dimethylpentyl ) carbodiimide. Furthermore, 
aspartyl and glutamyl residues may be converted to asparaginyl and 
glutaminyl residues by reaction with ammonium ions. 

15 Glutaminyl and asparaginyl residues may be frequently 

deamidated to the corresponding glutamyl and aspartyl residues. 
Alternatively, these residues may be deamidated under mildly acidic 
conditions. Either form of these residues falls within the scope of 
the present invention. 

20 Derivatization with bifunctional agents is useful for 

cross- linking the peptide to a water- insoluble support matrix or to 
other macromolecular carriers, according to known method steps. 
Commonly used cross - linking agents include, e.g., 
1, l-bis (diazoacetyl) -2-phenylethane, glutaraldehyde , 

25 N- hydroxysuccininri.de esters, for example, esters with 
4-azidosalicylic acid, homobi functional imidoesters, including 
disuccinimidyl esters such as 3,3'- 
dithiobis (succinimidylpropionate) , and bifunctional maleimides such 
as bis-N-maleimido-l, 8-octane. Derivatizing agents such as 

30 methyl -3 - [ (p-azidophenyl) dithio] propioimidate yield photoactivatable 
intermediates that are capable of forming crosslinks in the presence 
of light. Alternatively, reactive water- insoluble matrices such as 
cyanogen bromide -activated carbohydrates and the reactive substrates 
described in U.S. Patent Nos. 3,969,287; 3,691,016; 4,195,128; 

35 4,247,642; 4,229,537; and 4,330,440 (which are herein incorporated 
entirely by reference) , may be employed for protein immobilization. 
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Other modifications of GPR polypeptides of the present 
invention may include hydroxylation of proline and lysine, 
phosphorylation of hydroxyl groups of seryl or threanyl residues, 
methylation of the alpha -amino groups of lysine, arginine, and 
5 histidine side chains (T.E. Creighton, Proteins: Structure and 
Molecule Properties, W.H. Freeman & Co., San Francisco, pp. 79-86 
(1983)), acetylation of the N-terminal amine, methylation of main 
chain amide residues {or substitution with N-methyl amino acids) and, 
in some instances, amidation of the C- terminal carboxyl groups, 

10 according to known method steps. 

Such derivatized moieties may improve the solubility, 
absorption, permeability across the blood brain barrier biological 
half life, and the like. Such moieties or modifications of GPR 
polypeptides may alternatively eliminate or attenuate any possible 

15 undesirable side effect of the protein and the like. Moieties 
capable of mediating such effects are disclosed for example, in 
.Remington's Pharmaceutical Sciences, 16th ed. , Mack Publishing Co., 
Easton, PA (1S80) . 

Such chemical derivatives of GPR polypeptides also may 

20 provide attachment to solid supports, including but not limited to, 
agarose, cellulose, hollow fibers, or other polymeric carbohydrates 
such as agarose, cellulose, such as for purification, generation of 
antibodies or cloning; or to provide altered physical properties, 
such as resistance to enzymatic degradation or increased binding 

25 affinity or modulation for GPRs, which is desired for therapeutic 
compositions comprising GPR polypeptides, antibodies thereto or 
fragments thereof. Such peptide derivatives are well-known in the 
art, as well as method steps for making such derivatives using 
carbodiimides active esters of N- hydroxy succinimmide, or mixed 

30 anhydrides, as non- limiting examples. 

Variation upon consensus peptide sequences of GPR 
polypeptide of the present invention may also include: the addition 
of one, two, three, four, or five lysine, arginine or other basic 
residues added to the -COOH terminal end of the peptide; and/ or one, 

35 two, three, four, or five glutamate or aspartate or other acidic 
residues added to the amino terminal end of the peptide, where 
"acidic" and "basic" are as defined herein. Such modifications are 
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well known to increase the a-helical content of the peptide by the 
"helix dipole effect". They also can provide enhanced aqueous 
solubility of the peptide. See, e.g., Baldwin et al., supra 

As another non- limiting example of a GPR polypeptide of the 
5 present invention, serotonergic receptors (5-HT) consensus sequences 
may be determined using presently known 5-HT sequences and include, 
e.g., as consensus peptides of TM3 , TM5 and TM7, •/espectively : 

5-HT consensus (1) DDDDNIWSIFDWIGYLNSISMVIYTLFKKKK (SEQ ID NO: SO) 
5-HT Consensus (2) DDDDNIWHI FSTI GYLNS IS PVS VIMHIYGKKKK (SEQ ID NO: 81) 
10 5-HT consensus (3> DDDDGYSIYDTLVTFAINPVYITVFKKKK (SEQ ID IK): 82) 

Such non- naturally occurring consensus sequences may also 
be further modified according to known method steps to provide 
additional consensus peptides with substituted amino acids to 
increase or decrease a-helical propensity and/or solubility {e.g., 
15 hydrophilicity) . As a non -limiting example, 5-HT consensus peptide 
(1) above may be modified according to the present invention to have 
increase helical propensity and increased aqueous solubility as 
follows : 

5-HT consensus (4) DDDDNAWSAFDWALYLNSISMAIYTYAKKKK (SEQ ID N0:83), 

20 wherein, e.g., smaller, non-polar residues replace either larger, 
more polar residues {e.g., Ala for lie or Val) or larger aromatic 
residues {e.g., Ala for Phe) . 

Another non-limiting, illustrative example of consensus GPR 
polypeptides of the present invention are those for adrenergic 
25 receptors, are the following: 

An example of the consensus GPR polypeptide for 
domain VII across all presently known adrenergic receptors is as 
follows : 

adrenergic consensus (I) LFSFITWLGYAHSSLNPIIYTTF (SEQ ID NO:84) 

3 0 An example of a consensus GPR polypeptide for domain V 

across all adrenergic receptors is as follows: 
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adrenergic consensus (2) VYTIYSSSWFFAPSIAIMVITYT (SEQ ID HO: 85) 

Examples of a consensus GPR polypeptide for 
domain III across all adrenergic receptors are as follows: 

adrenergic consensus (3) IWLTSDIHSTSSILHNLCVISF (SEQ ID NO: 8b) 

An example of a consensus GPR polypeptide for domains III, 
V, and VII of all adrenergic receptors is as f oIIowb : 

adrenergic consensus (4) IWSIFSSDIWGYANHSSIAIHCPIVTYTV {SEQ ID NO: 87) 
adrenergic consensus (5) IFTIFSSDIAVGYANHSSAAIMPIVIYSV (SEQ ID NO:88), 

Wherein variations and substitutions of amino acids may be made as 
described herein. 

Non- limiting examples of consensus GPR polypeptides for 
transmembrane domain III across several or many, such as 1-500, or 
any range or value therein, G-protein receptors are as follows: 

TM3-U) YAI FVLYASAWLS FLNCP FI VTLNI {SEQ ID NO:96) 
TM3-(2) YAXFVLYATAWLSFLNGPFIVTLNI (SEQ ID NO: 97} 
TM3-(3) YAIFVLYATAWLTFLNCPFIVTLNI (SEQ ID N0:98) 
TM3-(4) YAIFVLYASAWLTFLNCPFIVTLNI {SEQ ID NO: 99) 
TM3 - ( 5 > WAIFVT.YASAWLSFLNCPFIVTLNI (SEQ ID NO: 100) 
TM3-(6> WAIFVLYAIAWLSFLNCPFIVTLNKSEQ ID NO: 101) 
TM3-(7) WAIFVLYAIAWLTFLNCPFIVTLNI (SEQ ID NO:102) 
TM3-<8) WAIFVLYASAWLTFLNCPFIVTLNI (SEQ ID NO:103) 
TM3-<9) YAVFVXYASAHLSFLNMPFIVTLNI (SEQ ID NO: 104) 
TM3-(10) YAVFVLYATAWLSFLNMPFIVTLNI (SEQ ID NG:105) 
TM3-(11) YAVFVLYATAMLTFLNMPFIVTIiNI (SEQ ID NO: 106) 
TM3- (12) YAVFVLYASAMLTFLNMPFIVTLNI (SEQ ID NO: 107) 
TM3-(13) YAI FVLYASAWLS FLNCVTAS I PFIVTLNI (SEQ ID NO:108) 
TIG- (14) YAI FVLYASAWLSFLNCTS S I WTAS IVTLNI (SEQ ID NO: 109) 
TM3-(15) YAIFVLYASAWLSFLNVTLNICTSSIV (SEQ ID NO: 110) 
TIG- (16) YAIFVLYASAWLSFLNTASIIiNLMFIVTIiNI (SEQ ID NO: 111) 
TM3-(17> YAIFVLYASAWLSFLNMASILNLPFIVTLNI (SEQ ID NO: 112) 
TM3-(18) YAIFVLYASAWLSFIJJSGIIiIJ^FrVTLNI(SEQ ID NO:113) 
TM3 -(19) YAIFVLYASAWLSFLNMSGILLLAPFIVTLNI (SEQ ID N0:114) 
TM3-(20) YAIFVLYASAWLSFLNSELSVYTLTVCPFIVTLNI (SEQ ID NOillS) 
TIC- (21) YAIFVLYASAWLSFLNMSELSVYTLTVPFIVTLNI (SEQ ID NO: 116) 
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1M3- (22) YAIFVLYASAWIiASELSVYTLWSFLNCPFIVTLNI (SEQ ID HO:117) 

TM3-{23) YAI FVLYASAWLAS ELSVYTLTVP F IVTLNI (SEQ 10 HO: 218) 

TM3-(24> YAIFVLYASAWLSFLASELSVYASELSSTLTTVNMPFIVTLNI (SEQ ID NO: 119) 

TM3 - ( 2 5 > YAIFVLYASAWLSFLNGGEIALWSLCPFrVTLNI (SEQ ID NO: 120) 

5 TM3 - ( 2 6 > YAIFVLYASAWLSFLNGGEIALWSLTVTLHI (SEQ ID HO: 121} 

TM3- (27) YAIFVLYASAWIjGGEIALWSLNCPFIVTLNI (SEQ ID HO: 122) 

TM3-(28) YAIFVLYAGGEIALWSLSFLNCPFIVTLNI(SEQ ID NO: 12 3) 

TM3-(29) YAIFVLYASAWLSFFFLLFGYLGHFLLHCPFIVTLNKSEQ ID NO: 124} 

TM3- (30) YAI FVLYASAWLFFFLLFGYLGNFLLPFIVTLNI (SEQ ID NO: 125) 

10 TM3-(31) YAIFVLYASAWLSFIiNTACFYVAITASLCFITEIAIjIPFIVTIjNI (SEQ ID NO: 12 6} 

TM3-(32) YAIFVLYASAWLTACFYVAITASLCFITEIALICPFIVTLNI (SEQ ID NO: 12 7) 

TM3-(33) YAIFVLYATACFYVAITASLCFITEIALISFLNCPFIVTLNI (SEQ ID NO:12e> 

TM3-(34> YAITACFYVAITASIiCFITEIALIASAWLSFLNCPFIVTLNI (SEQ ID NO: 129) 

TM3-(35> YAIFVLYATACFYVAIITEIALISAWLSFLNCPFIVTIiNI(SEQ ID NO:130) 

15 TM3-(3S) YAIEVLYASAWLSFLNACFYICLFAGVCFLIPFIVTLNI (SEQ ID NO: 131) 

TM3 - ( 3 7 ) YAI FVLYASAWNACFYICLFAGVMFLILSFLNCPFIVTLNI (SEQ ID NO:132) 

IMS - ( 3 8 ) YAIFVLYFYICIiFAGVCFLIASAWLSFLNCPFIVTLNI {SEQ ID NO: 133) 

TM3-(39) YAIFVLYASVDAVNMFTSAWLSFLNCPFIVTLNI (SEQ ID NOH34) 

TH3-(40) YAI FSVDAVNKFTVLYASAWLS FLNCP FTVTLNI (SEQ ID NO:135) 

2 0 TH3-(41> YAIFVLYASAWLSVDAVNMFTSFLNCPFrVTIiNI (SEQ ID NO: 136) 

TM3 - (42 ) YAIFVLYASAWLSFLNSVDAVNMFTPFIVTLNI (SEQ ID NO: 137) 

TM3-(43) YAIFVLYASAWI1SFI1NCPFIVSVDAVNMFTTLNI ( SEQ ID NO: 133) 

TM3-(44) YAIFVLYASAWLSVDMFTSFLNCPFIVTLNI (SEQ ID NO: 139) 

TM3-(45) YAISVDAVNMFTFVLYASAMLSFLNCPFIVTLNKSEQ ID NO:140) 

25 TM3-(46> YAIFSLSVFSLLAIVLYASAWLSFLNCPFIVTLNI (SEQ ID NO : 14 1 5 

TM3-(47) YAIFVLYASLSVFSILAISAWLSFLNCPFIVTLNI (SEQ ID NO:142) 

TM3-(4fl) YAIFVLYASAWLSLSVPSLLAISFLNCPFIVTLNI (SEQ ID NO: 143) 

TM3- (49) YAIFVLYASAWLSFLSLSVFSLIAINCPFIVTLNKSEQ ID NO: 144) 

TM3-(S0) YAIFVLYASAWLSFLNPFSLSWSLLAIIIVTLNI (SEQ ID NO:145) 

30 TM3-(51) YAIFVLYATAWLTFLNCVTATIPFIVTLNI (SEQ ID N0:146) 

TM3-(52) YAIFVLYATAWLSFLNCTSSIVVTATIVTLNI(SEQ ID NO:147) 

TM3-(53) YAIFVLYATAWI^FIJSrmiNICTTTIVtSEQ ID NO: 14 8) 

TM3-(54) YAIFVLYATAWLTFLNTAT I LNLMFIVTLNI (SEQ ID NO:149> 

TM3-(55) YAIFVLYATATOSFLNMa.TILNLPFIVTLNI(SEQ ID NO: 150) 

35 TIC- (55) YAIFVLYATAWLTFI^SGILLLAPFlVTLNItSEQ ID NO: 151) 

TM3-(57) YAIFVLYASAWLTFLNMTGILLLAPFIVTIiNI ( SEQ ID NO: 152) 

TM3 - ( SS ) YAI FVLYASAWLTFLNTELTVYTLTVCP F IVTLNI ( SEQ ID NO : 153 ) 

TM3- (59) YAIFVLYASAWLTFIiNKTELTVYTLTVPFIVTINItSEQ ID NO: 154) 

TM3- (SO) YAIFVLYATAWIATELTVYTLTVTFLNCPFIVTLNI (SEQ ID NO: 155) 

40 TM3-<61> YAIFVLYASAWIATELSVYTLTVPFIVTLNI ( SEQ ID NO: 156) 

TM3-(62> YAIFVLYATAmjSFIATELSVYASELSTTLTTVNMPFIVTLNI (SEQ ID NO: 157) 

TM3-{63> YAI FVLYATAWLSFLNGGE1ALNTLCPF IVTLNI (SEQ ID NO: 158) 

TM3-(G4> YAI FVLYASAMLTFLNGGE IALWTL IVTLNI (SEQ ID NO: 159) 

TM3-(65) YAIFVLYASAWLGGEIALWTLNCPFIVTLNI (SEQ ID NO: 160) 

45 TM3-<66) YAIFVLYAGGEIALWTLSFLNCPFIVTLNKSEQ ID NO: 161) 
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TM3-(S7) YAIFVLYATAWLSFFFLLFGYLGHFLLNCPFIVTLNI (SEQ ID HO: 152) 
TM3 -(58) YAI FVLYATAWLFFFLLFGYLGNFLLPFIVTLNI ( SBQ ID NO: 3.63) 
TM3-<69) YAIFVLYATAWLTFLHTACFYVAITASLCFITEIALIPFIVTLNI (SEQ ID NO: 164) 
TM3-(70> YAIFYLYATAWLTACFYVAITATLCFITEIALICPFIVTLNI (SEQ ID NO: 165) 
5 TH3- (71) YAIFV1YATACFYVAITATLCFITEIALISFLNCPFIVTLNI (SBQ ID NO:166) 
TM3-<72) YAITACFYVAITASLCFITEIALIATAWLTFLNCPFIVTLNKSEQ ID NO: 167) 
TM3-(73) YAIFVLYATACFYVAIITEIALITAWLTFLNCPFIVTLNI (SEQ ID NO:168) 
1W3-(74) YAIFVLYASAHLTFLNACFYICLFAGVCFLIPFIVTIiNKSEQ ID NO: 169) 
TM3-(75) YAIFVLYASAWNACFYICLFAGVMFLILTFLNCPFIVTLNI {SEQ ID NO: 170) 
10 TM3-{75) YAIFVLYFYICLFAGVCFLIATAWLTFLNCPFIVTLNI (SEQ ID KO:171) 
TM3-<77) YAIFVLYATVDAVNMFTTAMLTFLfICPFIVTLNI(SEQ ID NO: 172) 
TM3-(78> YAIFTVDAVNMFTVLYATAWLTFLNCPFIVTLNI (SEQ ID NO: 173) 
TM3-(79) YAIFVLYATAWLTVDAVNMFTSFLNCPFIVTLNKSEQ ID NO: 174) 
TM3-(80) YAIFVLYATAWLSFLNTVDAVNMFTPFIVTLNI (SEQ ID NO: 175) 
15 TM3-(81) YAIFVLYA3AWLTFLNCPFIVSVDAVNMFTTLNI (SEQ ID NO: 176) 
TM3-(82> YAIFVLYATAWLSVDMFTTFLNCPFIVTLNKSEQ ID NO:177) 
TM3- (83) YAISVDAVNMFTFVLYATAWLSFLNCPFrVTLNI (SEQ ID NO:178) 
TM3-(84) YAIFVLYASLTVFSLLAISAMLTFLNCPFrVTLNI (SEQ ID NO: 179) 
TM3-(85) YAIFVLYASAWLTLSVFTLIAISFLNCPFIVTLNKSEQ ID NO: 180) 
20 TK3-(86) YAIFVLYASAWLTFLSLSVFT1LAINCPFTVTLNI (SEQ ID NO:181) 
TK3-(87) YAIFVLYASAWLTFLNPFSLSVFSLLAIIVTLNKSEQ ID NO: 182) 
TM3- (88) YAIFVLYASAWLS FLNLGGVTASFTASVGPFIVTLNI (SEQ ID NO:183) 
Ha- (89) YAIFVLYASAWLSFLNU3GVTASFTASVGVTLNI (SEQ ID NO: 184) 
TH3-O0) YAIFVLI^VTASFTASVNYASAWLSFLNCPFIVTLNI(SEQ ID NO: 185) 
25 TM3-(31) YAIFVLYAIFFFLLFSAWLSFLNCPFIVTLNI (SEQ ID NO: 166) 

TM3 - ( 9 2 ) YAIFVLYASAWLSFLNCPFI VTLNI I FFFLLFIVTLNI (SEQ ID NO : 1 8 7 ) 
TM3-(93} YAIFVLYASAWIFFFLLFLSFLNCPFIVTLNI (SEQ ID NO: 188) 
TM3-(94) YAIFVLYASAin<FFTVLASELSVYTLTVSFLNCPFIVrLNI(SEQ ID HO:189) 
TK3-(95) YAIFVLYASAWLSFDFATLGGEIALCPFIVTLNKSEQ ID ND:190) 
30 TM3-(96) YAIFVLYAFATI/SGEIALSAWLSFliNCPFIVTLNI (SEQ ID NO: 191) 

TM3-{97) YAIFFTVIASELSVYTLTVYASAWLSFLNCPFIVTLNI (SEQ ID NO:132) 
TM3-(98) YAIFFPIAALFASIASAWLSFLNCPFIVTLNT (SEQ ID NO:193) 
TM3-(93) YAIFVLYASAWLSFFPIAALFASIPFIVTLNI (SEQ ID NO: 194) 
IM3-(100) YAIFVLYASAWLSFLNCPFFPIAALFASILNI (SEQ ID NO: 195) 
35 TM3- (101) YAIFVLYASAWIiSLDVLFSTASIMHLSFLNGGEIALWSLIVTLNI (SBQ ID NO:196) 
1W3-<102) YAIFVLYASIJJVLFSTASIMHLIAIiWSIJJCPFIVTLNI(SEQ ID NO: 19 7) 
TM3-U03) YAIFVLYAGGEIALWSLSFLNSIDVIFSTASIMHLPFIVTLNKSEQ ID NO: 198) 
TbB-(104) YAIFVLYASAWLSFFDVLFSTASIMHLFGYLGNFLLNCPFIVTLNKSEQ ID NO: 199) 
TM3-U05) YAI FVLYASAWLF FFLLFQYLS 1DVL FS TAS IMHLGNFLLPF IVTLNI (SEQ ID NO;200) 
40 TM3-(106) YAIFVLYASAWLSFLNTACFYVAITASLSLMHLFITEIALIPFIVTLNI ( SEQ ID N0:201) 
TM3-I107) YASLDULFSTAIMHLSAWLTACFYVAITASLCFITEIALICPFIVTLNI (SEQ ID NO:202) 
TM3-(108) YAIFVLYATACFYVAITASLSFLNCPFIVTLNISLDVLFSTASIMHL ( SEQ ID NO:203) 
TM3-U09) YAITACFYVAITASLCFITEIALIASAWLSFLNCPFIVTLNI (SEQ ID NO: 204) 
TM3-U10) YAI FVLYATACFYSTAS ILNLIMHLCAI SLVAI ITEIALI SAWLS FLN (SEQ ID NO:205) 
45 TtD-(lll) YAIFVLYASAPfLSFLNACFYICLFASILNLIMHLCSVCFLIPFITTLNI (ijEQ ID NO:206) 
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TM3-{112) YAIFVLYASAffllASlLNLIMHIiCFYICLFAGVMLILSFIiNCPFIVTLNI {SEQ ID NO:207) 
TM3-(113) YAIFPFVQCVVSIFSLVLIAVVLYFYIAGVCFLIASAWLSFLNCPFIVTI (SEQ ID NO: 208 
TM3-CL14) PFVQCVE ITVS IFSLVIjXAVYAI FVLYASVDAVNMFTSAWCPFIVTLNI ( SEQ ID NO: 209! 
TM3 -(115) YAI FGDWSSVDAVNMFTVLYASAWLSFLNCPFIVTLNI (SEQ ID NO : 210) 
TM3-U16) YAIFVLYAGDWSSAWLSVDAVNMFTSFLNCFFIVTLNI (SEQ ID NO: 211) 
TM3-U17) YAIFVLYASAWLGDWSSFLNSVDAVNMFTPFrVTLNI (SEQ ID NO:212) 
TM3-U18) YAIFVLYASAWLSFLNCPFIVQDWSSVDAVNMFTTLNI (SEQ ID NO:213) 
TM3-(H9) YAIFVLYASAWLGYLGSVDKFTSFLNCPFIVTGDWSLNI (SEQ ID NO:214) 
TM3 -(120) YAISVDAVNMFTFVLYA3YLQSAWLSFLNCPFI VTLNI {SEQ ID NO: 215) 
TM3-(121) YAIFSLSVFSLIAIVLYASAWLGYLGSFLNCPPIVTLNI (SEQ ID NO:216) 
TM3-(122) YAIFVLYAGYlQAGNMDSLSVFSLIAISAWLSFIiNCPFIVTLNI [SEQ ID NO:217) 
TM3-(123) YAIFVLYASAWLSLSVFGNMSIilAISFLNCPFIVTLNI (SEQ ID NO:218) 
TM3-(124) YAI FVLYASAWLS FLSLSVFGGSLLAINCPFIVTLNI (SEQ ID NO: 2 19) 
TH3-U25) YAI FVLYASAWLS FLNP FSLSVFGSLLAI IVTLNI (SEQ ID NO: 220) 
TM3-<126) YAIFVLYATAWLTFLSLANCVTATIPFIVTLNI (SEQ ID NO:221) 
TM3-U27) YAIFVLYATAWLSFLNCTSIASSIVVTATIVTLNI (SEQ ID NO:222; 
TM3-U28) YAIFVLYATAWLSFLNVTLNI SIAC^TTIV ( SEQ ID 110:223) 
TK3-(129) YAIFVLYATAWLTFLNTATILSLANLMFIVTLNI (SEQ ID NO:224) 
TK3-(130) YAIFVLYATAWLSFLNKATILNLPFSVDAVIVTLNI (SEQ ID NO: 22 5) 



2 0 Recently discovered G-proteins also can be used according 

to the presently claimed invention to provide GPR polypeptides of the 
present invention, based on the teaching and guidance presented 
herein. Exampled of such GPR polypeptides of the present invention 
may include, as non- limiting examples, GPR polypeptides corresponding 

25 to transmembrane domain III, e.g., as follows: 



TM3-(131> ISTMYIVTGRWTLGQVVCDFWLSSDITCCTASILHLCVTAL (SEQ ID NO:226) 

HO- (132) ILYGYRWPLPSKLCAVWIYLDVLFSTASIMHLCAISL (SEQ ID NO:227) 

TM3-(133) IIYI VMDRWKLGYFLCEVWLSVDMTCCTCS ILHLCVIAL {SEQ ID NO: 228) 

TM3- (134) IADKTVRVAKGAENDLGYNFHSDDVCGHCWQWYCSI. (SEQ ID NO:229) 

3 0 TM3-(135> I LNYWPFGLALCHFVNYSQAVS VLVSAYTLVAI S I (SEQ ID NO: 230) 

TM3- (136) ILGRWEFGIHLCKLWLTCDVLCCTSSILNLCAIALD (SEQ ID NO:231) 

TM3- (137) IKASVMHRHCLPLIGICLSSEHHCLVSIFVELGAL (SEQ ID NO:232) 

Further non- limiting examples of consensus GPR polypeptides 
for transmembrane domain III of several or many, such as 1-500, or 
35 any range or value therein, more recently discovered G- protein 
receptors are as follows: 



ITO- (136) YAI FVLYASAWLS FLNCPFIS ILHLCVIALVTLNI (SEQ ID NO:233) 
■TK3- (139) YAIFVLYATAWLSFLNCPFISILNLCAIALDVTLNI (SEQ ID NO: 234) 
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TM3-C.40) YAIFVLYATAWLTFLNCPFISIFVELGALVTLNI (SEQ ID NO: 235) 

TM3-(141) YAIFVLYASAWIiTFIjNCPFISIFVELSIMHLCAISLGALVTLKI (SEQ ID NO:236) 

TM3-(142) WAIFVLYAILGRWEFQIHLCKLWLTSAWLSIMKLCAISLSFLKCPFIVTLNI (SEQ ID NO:237) 

TM3-(143) WAIFVLYAIIfiHWEFGIHLCKLWLTTRWLSIMHLCAISLSFLKCPFIVTLNI (SEQ ID 110:238) 

5 TM3-CL44) WAIFVLYATAWLTFLNCPFSIMHLCAISLIVTLNKSEQ ID NO:239) 

TM3-(145) WAIFVLYASAWLTFLNCPFISIMHLCAISLVTLNI (SEQ ID NO: 240) 

TM3-(146) YAVFVLYASAWLSFLNMSIMHLCAISLPFIVTLNI (SEQ ID NO:241) 

TM3-U47) YAVFVLYATAWLSFLNMPFSILSLCAIftLDIVTLNI(SEQ ID NO: 242) 

TM3-(148) YAWVLYATAWLSILNLCAIALDTFLNMPFIVTLNI (SEQ ID NO:243) 

10 TM3-(149) YAVFVLYAESILNIXIAIALDSAWLTFLNMPFIVTLNI (SEQ ID NO: 244) 

TM3-(150) YAIFVLYASAMLSFIMCVTASIPFCLVSIFVELGALIVTLNKSEQ ID N0:245) 

TH3-U51) YAIFVLYASAWI^FI«CLVSIFVEI^ALIVVTASIVTLNI(SEQ ID NO: 246) 

TM3-(1S2> YAIFVLYASAWLSFLNVTLNCLVSIFVELGM.il (SEQ ID NO: 24 7) 

TM3-(1S3> YAIFVLYASAWLSFIiNTASILNLMFICLVSIFVELGALVTLNI (SEQ ID NO: 246) 

15 TM3-(154> YAIFVLYASAWLSFINMASILNLPFCLVSIFVELGALVTLNI (SEQ ID NO: 249) 

TM3- (155) YAIFVLYASAWLSFLNIIiGroffiFGIKIiCKLWLTC^VIiCCTSSGILIiIiAPFIVTIjNI (SEQ ID NO:250) 

TM3- (156) YAIFVLYASAWLSFIiNMIIjGRWEPGIHLCKLWLTCDVIXrCTSSGILIilJ^FIVTL^ (SEQ IDNO: 251) 

TM3- (157) YAIFVLYASAVttlliGRMEFGIHLCKLWLTCDVLCCTSSFLNSELSVYTLTVCPFIVTLNI (SEQ ID 
NO:252) 

20 TM3-U58) YAIFVLYAIIiGRHEFGIHLCKLWLTCDVLCCTSSAHLSFIiNHSELSVYTLTVPFIVTXiNI (SEQ ID 
NO: 253) 

TH3-(159) YAIFVLYASAWLASRWPLPLSVYTLTVSFLNCPFIVTLNI (SEQ ID NO: 254) 

TM3- (160) YAIFVLYASAWLASELILYYHSWPLPCLHDLVHLCTCS ILHLCVIALSV iTLTVPFIVTLNI ( SEQ ID 
NO: 255) 

25 TM3- (161) YAIFVLYASAVfLSFLASELSVYASELSSTLHDIAn^WLDWCVIALTTVNMPFI ( SEQ ID 

NO: 256) 

TK3 - ( 162 ) YAI FVLYASAWLSFLNGGEIALWSLCPFI ILYYWRWPLPCLHDLVS ILHLCVIALVTLMI (SEQ ID 
NO:257) 

TM3- (163) YVWLV^WCCTCSIIJILCVIALFVLYASAWLSFLNGGSIALWSLIVTLNI ( SEQ ID NO; 258) 

3 0 TIC- (164) YAIFVLYASAWLAIILYYWRWPLPCLHDWKEIALWSIiNCPFIVTLNI (SEQ ID NO: 259) 



Non- limiting examples of consensus GPR polypeptides for 
domain V across several or many, such as 1-500, or any range or value 
therein, G-protein receptors are as follows: 



TM5-U) CDVFVFVDIMLCTASIFNLCAISVG (SEQ ID NO:260) 

TM5-{2) YAIFVLYDIMLCTASIFNLCAISVG (SEQ ID NO:261) 

TM5-(3) DYAIFVFVDIMLMTASIFNLMAISVGtSEQ ID NO:262) 

TM5-(4) D YAI FVFVD IMLKTTASTI FNLMATITVG (SEQ ID NO:263) 

TM5-(5) CDVAWYSSDIMLFYVCTASIFSSNLCAISSVG (SEQ ID NO: 264) 

TUB- (6) FLFCSLGSFYIPIAVILVDIMLCTASIFNLCAISVG (SEQ ID NO:.265) 

TM5-(7) YAI FVLYDFLFCSLGSFYIPIAVILIMLCTASI FNLCAI SVG ( SEQ ID NO:266) 

TM5-(8) DYAIFVFVDIMLMTASIFLFCSLGSFYIPIAVILISVG (SEQ ID NO: 267) 

TM5-(9) DYAIFVFVDIMIiHTTASTIFNLMAFLFCSLGSFYIPIAVTLTITVG (SEQ ID NO:268) 
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TM5-U0) CDVRWYSSDIMLFYVCTASIPSSNLFLFCSLGSFYCA1SSVG(SRQ ID NO:269) 
TM5- (11) CDVFVFVDIMLCTASIFNWyiLSSIGSFFAPCLILLVYLLCAISVG(SEQ ID NO:270) 
TM5-<12) YAlEVLTOIMLCTASIFNLCAIffYILSSIGSFFAPCLILLVTLSVGISEQ ID HO:271) 
TM5-(13) DYAIFVFVDIWYILSSIGSFFAPCLILLVYIAS1FNLMMSVG(SEQ ID NO:272) 
5 TM5-<14) DrAIWYILSSIGSFFAPCLILLVYLIMLHTTASTIFNLMATITVG(SEQ ID HO:273) 

TIE- (15) CDVAWYS SDIHLFYVCWYILSS IGS FFAPCLILLVYLSSNLCAISSVG ( SEQ ID NO:274) 
TM5- (16) CDVFVFVDIKLCTASIFWYVISSSIGSFFAPCLIHHLVY1ILCRISVQ (SEQ ID 110:275) 
TM5- (17) YAIFVLYDimjCTASIFHLCAIWYVlSSSIGSFFAPCLINHLVySVG(SEQ ID 190:2 7 6) 
TM5-{18) DYAIFVFVWYVTSSSIGSFFAPCLINHLVYDIMIjOTASIFNLMArSTC (SEQ ID 110:277) 
10 IMS- (19) DYAIFVFVDIMLHTTASTIFMYVISSSIGSFFAPCLIHHLVYTVG(SEQ ID N0:278) 

TM5- (20) CDVAVVYSSDIMLFYVCTASIFSWYVISIGSFFAINHLVYNLCAISSVG(SEQ ID N0:279) 
TM5-<21) CDVFVFUDIMLCTASIFNLCAITYAISSSVISFYIPVAILVTYT (SEQ Tu NO:280> 
TK5-(22) YAIFVLYDIKLCTATYAISSSVISFYIPVAILVTYTSIFNLCAISVG (SEQ ID NO: 281) 
TM5-(23) DYAIFVFVDIMLMTATYAISSSVISFYIPVAILVTYTISVG (SEQ ID HO: 2 82) 
15 TM5-<24) TYAISSSVISFYIPVATTDYAIFVFVDimjnTASTIFNLMATlrVG (SEQ ID NO:283> 
TM5-(25) CDVAWYSSDIMLFYVCTATYAISSSVISFYIPVAILVTYTSSVG(SEQ ID NO:284) 
TMS-(26) CDVFVFVDFVIYSSVVSFYLPFGVTVLVYACTASIFNLCAISVG ( SEQ ID NO:285) 
TM5-{27) YAIFVLYDFVIYSSVVSFYI,PFGVTVLVYASIFNLCAISVG(SEQ ID NO:286) 
TM5-{28) DYAIFVFVDFVIYSSWSFYLPFGVTVliVYATASIFNLMAISVG (SEQ ID NO:287) 
20 TM5-(29) DYAIFVFVDFVIYSSWSFYLPFGVTVLVYAHTTASTIFNLMATITVG (SEQ ID 390:288) 
TO!5-(30) CDVAVVYSSDFVIYSSVVSFYIJFGVTVYVCTASIFSSNLCAISSVG(SEQ ID N0:289> 
TM5 - { 3 1 ) CDVFVFVDIMLCTAS YTIYSTCGAFYI PSVLLI ILYGNLCAI SVG (SEQ ID HO: 290) 
TM5-(32) YAIFVLYDIMLCEfiSYTIYSTCGRFYIPSVLLIILYGNLCAISVG (SEQ ID NO:291) 
TM5-<33) DYAIFVFVDIMLMTASYTIYSTCGAFYIPSVLLIILYGHIiClISVG (SEQ ID N0:292) 
25 TM5-(34) DYAIFVFVDIMUaTTASYTIYSTtXBlFYIPSVLLIILYGMA.TITVG (SEQ ID HO: 293) 

TOE- (35) CDVAWYSSDIMSYTIYSTCGAFYIPSYLLIILYGIFSSHLCAISSVG(SEQ ID NO: 294) 
TM5 -(36) CDVFVFFVLIGSFVAVDIIILCTASIFHLCAISVG ( SEQ ID NO: 295) 
TM5 - ( 3 7 ) YAIFVLYFVLIGSFVADIMLCTASIFNLCAISVG {SEQ ID HO : 295) 
TOG- (38) DYAIFVFVFVLIGSFVADIMLlCTASIFNIiMAISVG (SEQ ID NO:297) 
30 TM5-(39) DYAIFVFVFVLIGSFVADIMLHTTASTIFHIJIATITVG (SEQ ID HO:298) 

TM5-<40) CDVAVVYSSFVLIGSFVADniLFYVCTASIFSSHI»CAISSVG (SEQ ID NO: 299) 
TM5-(41> CDVFVFVDIMLCFFIPTLIMVITYFNIiCAISVG (SEQ ID HO: 3 00) 
TMS-(42) YAIFVLYDIMLCFFIPTLIMVITYFFNLCAISVG (SEQ ID NO: 301) 
TM5-(43) DYAIFVFVDIMLMFFIPTLIMVITYFNLMAI SVG (SEQ ID H0:302) 
35 TM5-(44) DYAIFVFVDIMUOTFFIPTLIMVTTYFNLMATITVG (SEQ ID HO: 303) 

TM5- (45) CDVAWYSSDIMLFYVCFFIPTLIMVITYFSSNLCAISSVG(SEQ ID NO: 304) 
TH5- (46) CDVVYGLVDGLVTFYLPLLIlffilTYYDIMLCTASIFNLCAISVG (SEQ ID NO: 305) 
TIG- (47) YAIVYGLVDGLVTFYLPLIJirciTYYDIHLCTASIFNLCAISVG (SEQ ID NO: 30 6) 
TM5-{48) DYAIVYGLVDGLVTFYLPLLIHCITYYDIMUTrASIFNlMAISVGtSEQ ID NO: 307) 
40 TM5-{49) DYftlVyGLVDGLVTFYLPLLIMCISSDIMljHTTASTIFNLMATITVG { SEQ ID NO : 3 0 8 ) 
TM5-(50) CDWYDGLVTFYLPLLIMC ITYYD IMLFYVCTAS I FSSNLCAI SS VG ( S EO ID NO:309) 
TM5-(51) CDVFVI^I1^VIFLGLVIVIPFVLIIVSYASIFHLCAISVG{SEQ ID NO:310) 
TM5-(52) YAIFVLYDIM^VIFLGLVIVIPFVLirvSYAIFHLCAISVG(SEQ ID NO:311) 
TM5 - ( 53 ) DYAIFVI^IMIJittVIFLGLVIVIPFVLIIVSYAIFNLMAISVG (SEQ ID NO: 3 12) 
45 TM5-I54) DYAIFVFVDIMLHTLVIFLGLVIVIPFVLIIVSYAIFNIjMATITVG (SEQ ID NO: 3 13) 
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TM5-(55) CDVAVVYSSDIMLFLVIFLGLVIVIPFVLIIVSyAIFSSNLCAISSVGIS^Q ID NO:314) 
TM5-<56> CDVFVFVDIMIiCTALMI YILGGLI III PFLLI VMSYVS IFNLCAI SVG ( SEQ ID NO:315) 
TMS- (57) YAIFVLYDIMLCTALMIYILGGLI 1 1 I PFLLI VMSYVS I FNLCAI SVG f SEQ ID NO: 316) 
TM5-(58) DYAIFWVDnn^ASIFm2aYII^LIIIIPFIiIVI^yVLKAISVG(SEQ ID NO: 317) 
TM5-(59) DYAIFVFVDIMLHTTASTILMIYILGGLIIIIPFLLIVMSYVTTVG (SEQ ID HO : 31B) 
TM5-(60) CDVAVVYSSDIKLFTVCTAYlI^LIPFIilVmTVSIFTHLCAISSVGtSEQ ID NO:319) 
TIB - ( 6 1 ) CDVFVFVDIMLCTAS I FNLIJfflHIMBVI 1 1 VI PFVLIVISYACAISVG (SEQ ID NO; 320) 
TM5-<e2) YAIFVLYDimCTASIFKLLMIHIMEVIIIVIPFULIVISYACAISVG (SEQ ID 110:321) 
TM5-(S3) DYAIFVI^IHUniASIFIMIHllflEVIIIVIPFVLIVISYAISVG(SEQ ID H0:322) 
TM5-(64) DYAIPVFVDEMLKTTASTIIJIIKIMEVIIIVIPFVLrviSYAITTG (SEQ ID NO:323) 
TM5-(S5) CDVAVVYSSDIMLFYVCTASIFLMIHIMEVIIIVIPFVLIVISYAAISSVGfSEQ ID NO:324) 

Non-limiting examples of longer consensus GPR polypeptides 
for domain V across several or many, such as 1-500, or any value or 
range therein, G-protein receptors are as follows: 



TMlNWPALSIVVIIimiGGNILVin^VSIYTSLDVm^ 
IGYVCSSSLGINPVIIYTLF(SEQ ID NO: 325) 

T M i . < 2 

NW^SIWIIINTIGGHILVIMAVTIYTT^ 
VCTTTLGINPVIIYTLF(SEQ ID NO: 32 6) 

T M ! (3 

ITOPALTIVVIIINTIGGNILVIMAVSIYTmjVHLCTATILNIilTLFVLIGTF^^ 
VCSTSLGINPVIIYTLF (SEQ ID NO:327) 

T M 1 ( 5 

HWPALTIWIIINTIGGNILVIMAVTIYTCIB^ 
VCTIiGINPVIIYTLF (SEQ ID NO: 328) 

T " 1 - (6 

NWKNWSALLTTVVIILTIAGNILVIMAVS^ 
VCSSSLGINPVIIYTLF (SEQ ID NO: 329) 

T M 1 17 

ITITVVIAVLIIJITTOGNVWCIAVGSIYTSIJJVMLCTASILNLLISLFV^ 
YVCSSSLGINPVIIYTLFISEQ ID NO:330) 

T M 1 - (8 

TLTLVCIACLUSLTVFGNVLVI IAVFSLDVMLCTAS ILNLLISLFVLIGS FVAFFIPLTIKVITYFLFNVFFVWIGYVCS 
SSLGINPVIIYTLF (SEQ ID NO: 331) 

T H 1 ( 9 . 

TAAIAAAITFLILFTIFGNALVIIAVLSIYTSLDVMLCTASILNIiLISLFVLIGSFVAFFIPLTIMVIT^ 
GYVCSSSLGINPVIIYTLF(SEQ ID NO:332) 

T M 1 " < 1 0 

AISVGLVI^ILFAIVGNILVILSVANWPAI^rVWIINTIGGNILVIMAVSIYTSLDVm 
FVAFFIPLTIMVITYFLFNVFFVWIGYVCSSSLGINPVIIYTLF (SEQ ID NO: 333) 
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T K . 1 - ( ! 1 , 

AMiAGALliALAVLATVGGNLLVIVAIASLDVMLCTAS ILNLLI SLFVLIGSFVAFFIPLTIMVITYFLFNVFFVWIGYVC 
SSSLGINPVTIYTLF (SEQ ID NO: 334) 

T M 1 (12) 

TAGDCLIMLIVLLIVAGNVLVIVMSMVML^ 

SLGINPVIIYTLF<SEQ ID NO: 335) 

T M 1 - <13) 

VITIAWTAWSLariVGNVLVMI SFS ITTSLDVMLCEAS ILNLLI SLFVLIGSFVAFFIPLTIMVTTYFLFNVFPWIG 
YVCSSSLGINPVirYTLF(SEQ ID NO: 33 6) 

T H 1 (14) 

MVFIATVRGSLSLVTWGNILVMLSISIOTSIJJVMLCTASIL^ 

YVCSSSLGINPVTIYTLFiSEQ ID NO: 337) 

T M 1 (15) 

WFLAFLTGIIALVTIIGNILVIVSFSIYTS1I)VMLCTASILNLLIS^ 

VCSSSLGINPVIIYTLF(SEQ ID 190:338) 



Non- limiting examples of longer consensus GPR polypeptides 
for domain V across several or many, such as 1-500, or any value or 
range therein, G-protein receptors are as follows: 



NWPALS IWI I INTIGGNILVTMAFFACFVLVLTQSSIFSLLAI AINLLISLFVLIGS FVAFFIPLTIMVITYFLFNVFF 
VWIGYVCSSSlCINPVIIVTLFtSEQ ID NO: 3 39) 

T M 3 - (15 6) 

NWPALSI WI I INTIGGNILVIMAFFACFVLVLTQSSI FSLLA1AI FVLIGSFVAFFIPLTIMVITYFLFHVFFVWIGYV 
CSSSLGINPVIIYTLF(SEQ ID NO:340) 

T M 3 (15 7) 

NWPALS IWI I INTIGGNILVIHAVMVACPVLILTQSS IIALLAIAVS FVAFFIFLTIMVITYFLFNVFFVWIGYVCSSS 
LGINPVIIYTLF(SEQ ID NO: 341) 

T M 3 (16 8) 

NWPAI^IVVIIIHTIGGNILVIMAVLWLAI^YVASNASVLNLL^ 

PVIIYTLF(SEQ ID NO: 342) 

T M 3 - (15 9) 

NWPALSI WI I INTIGGNILVIMAVLYWSNASVMNLL^ ^ 
IIYTLF (SEQ ID NO: 343) 

T M 3 - ( i 7 0 ) 

NWPALS IWI I INTIGGNILVIMAVLWIAIDYVASNAS VLNLLVI SFGSFVAFFIPLTIMVITYFLKNVFFVWIGYVCSS 
SLGINPVIIYTLF (SEQ ID NO: 344) 

T M 3 (17 1) 

NWPALS IVVI IIKTIGGNILVIMAVLFPFLQKS SVGITVTiNLCALSGSFVAFFIPLTIMVITYFLFNVFFVWIGYVCSSS 
LGINPV3IYTLF {SEQ ID NO:34S) 

T M 3 - (17 2) 

NWPALSIVVIIINTIGGNILVlMaVCITYLQYI^INASSCSITAFTIIGSFVAFFIPLTIMVI^ 
SSLGINPVIIYTLF (SEQ ID NO: 346) 
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T M 3 (17 3) 

NWPAlSIWIIIOTIGGNILVIMAVFHNFFPljyUjF^ 

LGINPVIIYTLF (SEQ ID NO: 347) 

T M 3 (17 4) 

IWPALSIVVIIINTIGGNILVIMAVIASASVSFNLYAS^ 

SLGISPVIIYTLF(SEQ ID NO: 34 8) 

As another non- limiting, illustrative example of a GPR 
polypeptide consensus sequences across each individual or different 
transmembrane domains of 5-HT receptors may be made, such as for 5- 
HT, as the following: 

5HT consensus (4 ) KNASALLS VI I INS IGGNWTAVS (SEQ ID NO: 349); 
5HT consensus (5) YFLMSIAVTDLWSFVMPVSAL (SEQ ID NO: 350); 
5HT consensus (6) AITKIAITWAISGVSVPFIPVWG (SEQ ID NO: 351) ; and 
5HT consensus (7) LGIIFGTFIIIWLPFFITNLVSPI {SEQ ID NO: 352); 

Wherein variations and substitutions of amino acids may be made as 
described herein. 

Alternatively, S-HT consensus sequences may be 
provided as consensus peptides of the present invention as consensus 
peptides for individual transmembrane domains, such as 5-HT domains 
III, V and VII, e.g., as follows: 

5-HT consensus (8) : IWISLDVLFSTASSIMHLCAISL (SEQ ID ND:353) 
5-HT consensus (3) : GYTIYSTr.VTFYIPSVIMVTTYG (SEQ ID N0:354j 
5-HT consensus (10) : LLNFFNWIGYLNSLINPVlyTLF (SEQ ID NO: 355) 

This invention is also directed to an antibody which binds 
an epitope specific for a GPR polypeptide of the present invention 
and the use of such an antibody to detect the presence of , or measure 
the quantity or concentration of, the GPR protein in a cell, a cell 
or tissue extract, a biological fluid, an extract thereof, a 
solution, or sample, in vitro, in situ, or in vivo. 
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The term "antibody" is meant to include polyclonal 
antibodies, monoclonal antibodies (mAbs) , chimeric antibodies, anti- 
idiotype {anti-Id} antibodies to antibodies specific for GPR 
polypeptide of the present invention, as well as fragments, consensus 
5 polypeptides or chemical derivatives thereof. 

Polyclonal antibodies are heterogeneous populations of 
antibody molecules derived from the sera of animals immunized with 
an antigen. 

A monoclonal antibody contains a substantially homogeneous 
10 population of antibodies specific to antigens, which population 
contains substantially similar epitope binding sites. MAbs may be 
obtained by methods known to those skilled in the art. See, for 
example Kohler and Milstein, Mature 256:495-497 (1975); U.S. Patent 
No. 4,376,110; Ausubel et al, eds., Current Protocols in Moleculax 
15 Biology, Wiley Interscience, N.Y. , (1987, 1992); and Harlow and Lane 
Antibodies: A Laboratory Manual Cold Spring Harbor Laboratory (1988) , 
the contents of which references are incoporated entirely herein by 
reference. Such antibodies may be of any immunoglobulin class 
including IgG, igM, IgE, IgA, GILD and any subclass thereof. A 
20 hybridoma producing a mAb of the present invention may be cultivated 
in vitro, in situ or in vivo. Production of high titers of mAbs in 
vivo or in situ makes this the presently preferred method of 
production. 

Chimeric antibodies are molecules different portions of 
25 which are derived from different animal species, such as those having 
variable region derived from a murine mAb and a human immunoglobulin 
constant region, which are primarily used to reduce immunogenicity 
in application and to increase yields in production, for example, 
where murine mAbs have higher yields from hybridomas but higher 
30 immunogenicity in humans, such that human/murine chimeric mAbs are 
used. Chimeric antibodies and methods for their production are known 
in the art (Cabilly et al, Proc. Natl. Acad. Sci. USA 81:3273-3277 
(1984); Morrison et al . , Proc. Natl. Acad. Sci. USA 81:6851-6855 
(1984); Boulianne et al., Nature 312:643-646 (1984); Cabilly et al . , 
35 European Patent Application 125023 (published November 14, 1984); 
Neuberger et al., Mature 314:268-270 (1985); Taniguchi et al., 
European Patent Application 171496 (published February 19, 1985); 
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Morrison et al., European Patent Application 173494 {published March 
5, 1986); Neuberger et al . , PCT Application WO 86/01533, (published 
March 13, 1986); Kudo et al . , European Patent Application 184187 
(published June il, 1986); Morrison et al., European Patent 
5 Application 173494 (published March 5, 1986); Sahagan et al . , J. 
Immunol. 137:1066-1074 (1986); Robinson et al . , International Patent 
Publication No. PCT/VS86/ 02269 (published 7 May 1987); Liu et al., 
Proc. Natl. Acad. Sci. USA 84:3439-3443 (1987); Sun et al . , Proc. 
Natl. Acad. Sci. USA 84:214-218 (1987); Better et al., Science 

10 240:1041- 1043 (1988); and Harlow and Lane Antibodies: A Laboratory 
Manual Cold Spring Harbor Laboratory (1988)}. These references are 
incorporated entirely herein by reference. 

An ant i- idiotypic (anti-Id) antibody is an antibody which 
recognizes unique determinants generally associated with the antigen- 

15 binding site of an antibody. An Id antibody can be prepared by 
immunizing an aniinal of the same species and genetic type (e.g., 
mouse strain) as the source of the mAb with the mAb to which an anti- 
Id is being prepared. The immunized animal will recognize and 
respond to the idiotypic determinants of the immunizing antibody by 

20 producing an antibody to these idiotypic determinants (the anti-Id 
antibody). See, for example, U.S. patent No. 4,699,880, which is 
herein entirely incorporated by reference. 

The ant i- Id antibody may also be used as an "immunogen" to 
induce an immune response in yet another animal, producing a so- 

25 called anti-anti-Id antibody. The anti-anti-Id may be epitopically 
identical to the original mAb which induced the anti-Id. Thus, by 
using antibodies to the idiotypic determinants of a mAb, it is 
possible to identify other clones expressing antibodies of identical 
specificity. 

30 Accordingly, mAbs generated against a GPR polypeptide of 

the present invention may be used to induce anti-Id antibodies in 
suitable animals, such as BALB/c mice. Spleen cells from such 
immunized mice are used to produce anti-Id hybridomas secreting anti- 
Id mAbs. Further, the anti-Id mAbs can be coupled to a immunogenic 

35 carrier such as keyhole limpet hemocyanin (KLH) or cationized bovine 
serum albumin and used to immunize additional BALB/c mice. Sera from 
these mice will contain anti-anti-Id antibodies that have the binding 
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properties of the original mftb specific for a GPR polypeptide 
epitope . 

The anti-id mAbs thus have their own idiotypic epitopes, or 
"idiotopee" structurally similar to the epitope being evaluated. 
5 The term "antibody" is also meant to include both intact 

molecules as well as fragments thereof, such as, for example, Fab and 
F(ab') 2 , which are capable of binding antigen. Fab and F(ab') 2 
fragments lack the Fc fragment of intact antibody, clear more rapidly 
from the circulation, and may have less non-specific tissue binding 
10 than an intact antibody (Wahl et al., J. Nucl . Med. 24:316-325 
(1983) ) . 

It will be appreciated that Fab and F(ab'), and other 
fragments of the antibodies useful in the present invention may be 
used for the detection and quantitation of a GPR polypeptide 

15 according to the methods disclosed herein for intact antibody 
molecules. Such fragments are typically produced by proteolytic 
cleavage, using enzymes such as papain (to produce Fab fragments) or 
pepsin (to produce F(ab') 2 fragments). 

An antibody is said to be "capable of binding" a molecule 

20 if it is capable of specifically reacting with the molecule to 
thereby bind the molecule to the antibody. The term "epitope" is 
meant to refer to that portion of any molecule capable of being bound 
by an antibody which can also be recognized by that antibody. 
Epitopes or "antigenic determinants" usually consist of chemically 

25 active surface groupings of molecules such as amino acids, lipids or 
sugar side chains and have specific three dimensional structural 
characteristics as well as specific charge characteristics. 

An "antigen" is a molecule or a portion of a molecule 
capable of being bound by an antibody which is additionally capable 

30 of inducing an animal to produce antibody capable of binding to an 
epitope of that antigen. An antigen may have one, or more than one 
epitope. The specific reaction referred to above is meant to 
indicate that the antigen will react, in a highly selective manner, 
with its corresponding antibody and not with the multitude of other 

35 antibodies which may be evoked by other antigens. 

The antibodies, or fragments of antibodies, useful in the 
present invention may be used to quantitatively or qualitatively 
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detect a GPR polypeptide in a sample or to detect presence of cells 
which express a GPR polypeptide of the present invention. This can 
be accomplished by inununof luorescence techniques employing a 
fluorescently labeled antibody (see below) coupled with light 
5 microscopic, flow cytometric, or fluorometric detection. 

The antibodies (of fragments thereof) useful in the present 
invention may be employed histologically, aB in immunofluorescence 
or immunoelectron microscopy, for in situ detection of a GPR 
polypeptide of the present invention. In situ detection may be 

10 accomplished by removing a histological specimen from a patient, and 
providing the a labeled antibody of the present invention to such a 
specimen. The antibody (or fragment) is preferably provided by 
applying or by overlaying the labeled antibody (or fragment) to a 
biological sample. Through the use of such a procedure, it is 

15 possible to determine not only the presence of a GPR polypeptide but 
also its distribution on the examined tissue. Using the present 
invention, those of ordinary skill will readily perceive that any of 
wide variety of histological methods {such as staining procedures) 
can be modified in order to achieve such in situ detection. 

20 Such assays for a GPR polypeptide of the present invention 

typically comprise incubating a biological sample, such as a 
biological fluid, a tissue extract, freshly harvested cells such as 
lymphocytes or leukocytes, or cells which have been incubated in 
tissue culture, in the presence of a detectably labeled antibody 

25 capable of identifying a GPR polypeptide, and detecting the antibody 
by any of a number of techniques well-known in the art. See, e.g., 
Harlow and Lane, supra.; Ausubel et al, supra ; and Sambrook et al, 
supra . 

The biological sample may be treated with a solid phase 
30 support or carrier, such as nitrocellulose, or other solid support 
or carrier which is capable of immobilizing cells, cell particles or 
soluble proteins. The support or carrier may then be washed with 
suitable buffers, followed by treatment with a detectably labeled GPR 
polypeptide- specific antibody. The solid phase support or carrier 
35 may then be washed with the buffer a second time to remove unbound 
antibody. The amount of bound label on said solid support or carrier 
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may then be detected by known method steps, see, e.g., Harlow, supra : 
Ausubel, supra : or Sambrook, supra . 

By "solid phase support", "solid phase carrier", "solid 
support ", "solid carrier", "support" or "carrier" is intended any 
5 support or carrier capable of binding antigen or antibodies. Well- 
known supports or carriers, include glass, polystyrene, 
polypropylene, polyethylene, dextran, nylon amylases, natural and 
modified celluloses, polyacrylamides , gabbros, and magnetite. The 
nature of the carrier can be either soluble to some extent or 

10 insoluble for the purposes of the present invention. The support 
material may have virtually any possible structural configuration so 
long as the coupled molecule is capable of binding to an antigen or 
antibody. Thus, the support or carrrier configuration may be 
spherical, as in a bead, or cylindrical, as in the inside surface of 

15 a test tube, or the external surface of a rod. Alternatively, the 
surface may be flat such as a sheet, polymer test atrip, etc. 
Preferred supports or carriers include polystyrene beads. Those 
skilled in the art will know many other suitable carriers for binding 
antibody or antigen, or will be able to ascertain the same by use of 

20 routine experimentation. 

The binding activity of a given lot of anti-GPR polypeptide 
antibody may be determined according to well known method steps. 
Those skilled in the art will be able to determine operative and 
optimal assay conditions for each determination by employing routine 

25 experimentation. See, e.g., Harlow, supra . 

Other such steps as washing, stirring, shaking, filtering 
and the like may be added to the assays as is customary or necessary 
. for the particular situation. 

One of the ways in which a GPR polypeptide- specif ic 

30 antibody, anti-idiotype antibody or fragment thereof, can be 
detectably labeled is by linking the same to an enzyme and use in an 
enzyme immunoassay (EIA) . This enzyme, in turn, when later exposed 
to an appropriate substrate, will react with the substrate in such 
a manner as to produce a chemical moiety which can be detected, for 

35 example, by spectrophotometric , fluorometric or by visual means. 
Enzymes which can be used detectably label the antibody include, but 
are not limited to, malate dehydrogenase, staphylococcal nuclease, 
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delta-5-steroid isomerase, yeast alcohol dehydrogenase, alpha- 
glycerophosphate dehydrogenase, triose phosphate isomerase, 
' horseradish peroxidase, alkaline phosphatase, asparaginase, glucose 
oxidase, beta-galactosidase , ribonuclease, urease, catalase, glucose - 
5 6- phosphate dehydrogenase, glucoamylase and acetylcholinesterase. 
The detection can be accomplished by colorimetric methods which 
employ a chromogenic substrate for the enzyme. Detection may also 
be accomplished by visual comparison of the extent of enzymatic 
reaction of a substrate in comparison with similarly prepared 

10 standards. See, Harlow, supra . Ausubel, supra . 

Detection may be accomplished using any of a variety of 
other immunoassays. For example, by radioactivity labeling the 
antibodies or antibody fragments, it is possible to detect R-PTPase 
through the use of a radioimmunoassay (RIA) . A good description of 

15 RIA maybe found in L aboratory Techniques and Biochemistry in 
Molecular Biology, by Work et al., North Holland Publishing Company, 
NY (1978) with particular reference to the chapter entitled "An 
Introduction to Radioiiamune Assay and Related Techniques" by Chard, 
incorporated entirely by reference herein. The radioactive isotope 

20 can be detected by such means as the use of a 7- counter, a 
scintillation counter or by autoradiography. 

It is also possible to label an anti-GPR polypeptide 
antibody, anti-idiotype antibody or fragment thereof, with a 
fluorescent compound. When the f luorescently labeled antibody is 

25 exposed to light of the proper wave length, its presence can be then 
be detected due to fluorescence. Among the most commonly used 
fluorescent labelling compounds are fluorescein isothiocyanate, 
rhodamine, phycoerythrin, phycocyanin, allophycocyanin, 
o-phthaldehyde and f luorescamine , commercially avai lable, e.g., from 

30 Molecular Probes, Inc. (Eugene, Ore.). 

The antibody can also be detectably labeled using 
fluorescence emitting metals such as li2 EU, or others of the 
lanthanide series. These metals can be attached to the antibody 
using such metal chelating groups as diethyl enetriamine pentaacetic 

35 acid (EDTA) . 

The antibody also can be detectably labeled by coupling it 
to a chemi luminescent compound. The presence of the 
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chemilumines cent -tagged antibody is then determined by detecting the 
presence of luminescence that arises during the course of a chemical 
reaction. Examples of particularly useful chemiluminescent labeling 
compounds are luminol, isoluminol, theromatic acridinium ester, 
5 imidazole, acridinium salt and oxalate ester. 

Likewise, a bioluminescent compound may be used to label 
the antibody of the present invention. Bioluminescence is a type of 
chemilumines cence found in biological systems in which a catalytic 
protein increases the efficiency of the chemiluminescent reaction. 

10 The presence of a biolundnescent protein is determined by detecting 
the presence of luminescence. Important biolundnescent compounds 
for purposes of labeling are luciferin, lucif erase and aequorin. 

An antibody molecule of the present invention may be 
adapted for utilization in a immunometric assay, also known as a 

15 "two-site" or "sandwich" assay. In a typical immunometric assay, a 
quantity of unlabeled antibody (or fragment of antibody} is bound to 
a solid support or carrier and a quantity of detectably labeled 
soluble antibody is added to permit detection and/or quantitation of 
the ternary complex formed between solid-phase antibody, antigen, and 

20 labeled antibody. 

Typical, and preferred, immunometric assays include 
"forward" assays in which the antibody bound to the solid phase is 
first contacted with the sample being tested to extract the antigen 
form the sample by formation of a binary solid phase antibody -antigen 

25 complex. After a suitable incubation period, the solid support or 
carrier is washed to remove the residue of the fluid sample, 
including unreacted antigen, if any, and then contacted with the 
solution containing an unknown quantity of labeled antibody (which 
functions as a "reporter molecule"). After a second incubation 

30 period to permit the labeled antibody to complex with the antigen 
bound to the solid support or carrier through the unlabeled antibody, 
the solid support or carrier is washed a second time to remove the 
unreacted labeled antibody. 

In another type of "sandwich" assay, which may also be 

35 useful with the antigens of the present invention, the so-called 
"simultaneous" and "reverse" assays are used. A "simultaneous" and 
"reverse" assays are used. A simultaneous assay involves a single 
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incubation step as the antibody bound to the solid support or carrier 
and labeled antibody are both added to the sample being tested at the 
same time. After the incubation is completed, the solid support or 
carrier is washed to remove the residue of fluid sample and 
5 uncomplexed labeled antibody. The presence of labeled antibody 
associated with the solid support or carrier is then determined as 
it would be in a conventional "forward" sandwich assay. 

In the "reverse" assay, stepwise addition first of a 
solution of labeled antibody to the fluid sample followed by the 
i addition of unlabeled antibody bound to a solid support or carrier 
after a suitable incubation period is utilized. After a second 
incubation, the solid phase is washed in conventional fashion to free 
it of the residue of the sample being tested and the solution of 
unreacted labeled antibody. The determination of labeled antibody 
associated with a solid support or carrier is then determined as in 
the "simultaneous" and "forward" assays. See, e.g., for the above- 
mentioned immunological techniques, Harlow, supra : Ausubel et al, 
supra; and Sambrook et al, supra. GPR polypeptides of the present 
invention can be made by chemical synthesis or by recombinant 
methodB, wherein chemical synthesis is preferred. 

Synthetic , production of transmembrane p r- oteinR of t->,o 
present invent inn 

GPR polypeptides , variants and chemical derivatives thereof 
can be synthesized according to known method steps, including 
portions of known GPR transmembrane domains, consensus peptides 
thereof, conservative substitution derivative thereof or functional 
derivatives thereof. 

Chemical polypeptide synthesis is a rapidly evolving area 
in the art, and methods of solid phase polypeptide synthesis are 
well-described in the following references, hereby entirely 
incorporated by reference: (Merrifield, B. , j. Araer. chem. Soc. 
85:2149-2154 (1963); Merrifield, B . , Science 232:341-347 (1986); 
Wade, J.D. et al., Biopolymera 25:S2l-S37 (1986); Fields, G.B., Int. 
J. Polypeptide Prot. Res. 35:161 (1990); MilliGen Report Nos. 2 and 
2a, Millipore Corporation, Bedford, MA, 1987) Ausubel et al, supra . 
and Sambrook et al . supra . ~~ * 
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In general, as is known in the art, such methods involve 
blocking or protecting reactive functional groups, such as free 
amino, carboxyl and thio groups. After polypeptide bond formation, 
the protective groups are removed (or de-protected). Thus, the 
5 addition of each amino acid residue requires several reaction steps 
for protecting and deprotecting. Current methods utilize solid phase 
synthesis, wherein the C- terminal amino acid is covalently linked to 
an insoluble resin particle large enough to be separated from the 
fluid phase by filtration. Thus, reactants are removed by washing 

10 the resin particles with appropriate solvents using an automated 
programmed machine The completed polypeptide chain is cleaved from 
the resin by a reaction which does not affect polypeptide bonds. 

In the more classical method, known as the "tBoc method," 
the amino group of the amino acid being added to the resin-bound 

15 C-terminal amino acid is blocked with tert-butyloxycarbonyl chloride 
(tBoc) . This protected amino acid is reacted with the bound amino 
acid in the presence of the condensing agent 
dicyclohexylcarbodiimide, allowing its carboxyl group to form a 
polypeptide bond the free amino group of the bound amino acid. The 

20 amino -blocking group is then removed by acidification with 
trifluoroacetic acid (TFA) ; it subsequently decomposes into gaseous 
carbon dioxide and isobutylene. These steps are repeated cyclically 
for each additional amino acid residue. A more vigorous treatment 
with hydrogen fluoride (HP) or trif luoromethanesulf onyl derivatives 

25 is common at the end of the synthesis to cleave the benzyl- derived 
side chain protecting groups and the polypeptide- resin bond. 

More recently, the preferred "Fmoc" technique has been 
introduced as an alternative synthetic approach, offering milder 
reaction conditions, simpler activation procedures and compatibility 

30 with continuous flow techniques. This method w^s used, e.g., to 
prepare the peptide sequences disclosed in the present application. 
Here, the <x- amino group is protected by the base labile 
9-fluorenylmethoxycarbonyl {Prnoc} group. The benzyl side chain 
protecting groups are replaced by the more acid labile t -butyl 

35 derivatives. Repetitive acid treatments are replaced by deprotection 
with mild base solutions, e.g., 20% piperidine in dimethylf ormamide 
(DMF) , and the final HP cleavage treatment is eliminated. A TPA 
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solution is used instead to cleave side chain protecting groups and 
the polypeptide resin linkage simultaneously. 

At least three different polypeptide- resin linkage agents 
can be used: substituted benzyl alcohol derivatives that can be 
5 cleaved with 95% TFA to produce a polypeptide acid, methanolic 
ammonia to produce a polypeptide amide, or i% TFA to produce a 
protected polypeptide which can then be used in fragment condensation 
procedures, as described by Atherton, E. et al., J. chem. Soc. Perkin 
Trans. 1:538-546 (1981) and Sheppard, R.C. et al., Int. J. 

10 Polypeptide Prot. Res. 20:451-454 (1982). Furthermore, highly 
reactive Fmoc amino acids are available as pentaf luorophenyl esters 
or dihydro-oxobenzotriazine esters derivatives, saving the step of 
activation used in the tBoc method. 

Sequences available to use as a basis for polypeptide 

15 synthesis can be based on published sequences of G- protein coupled 
receptors, ligands and/or effectors, wherein the transmembrane or 
functional domains correspond to sections of hydrophobic or other 
amino acids of 5 to 100 amino acids, such as 5-10, 10-15, 15-25, 
20-25, 23-27, 25-30, 28-35, 20-40, 10-40, 20-30, 30-40, 40-50, 10-80, 

20 20-60 or 25-40 amino acids in length. Recombinant production of GPR 
polypeptides can be accomplished according to known method steps. 
Standard reference works setting forth the general principles of 
recombinant DNA technology include Watson, J.D. et al., Molecular 
Biology of the Gene, Volumes I and II, The Ben j amin/Cummings 

25 Publishing Company, Inc., publisher, Menlo Park, CA (1987); Darnell, 
J.E. et al., Molecular Cell Biology, Scientific American Books, Inc., 
publisher. New York, NY (1986); Lewin, B.M. , Genes JJI, John Wiley 
fc Sons, publishers, New York, NY (1989); Old, R.W. , et al., 
Principles of Gene Manipulation: An Introduction to Genetic 

30 Engineering, 2d edition. University of California Press, publisher, 
Berkeley, CA (1981); Ausubel et al, eds.. Current Protocols in *' 
Molecular Biology, Wiley Interscience, publisher, New York, NY (1987, 
1992); and Sambrook et al., Molecular Cloning: A Laboratory Manual, ' 
Second Edition, Cold Spring Harbor Laboratory, publisher, Cold 

35 Spring Harbor, NY (1989) , the entire contents of which references are 
herein incorporated by reference. 
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A nucleic acid sequence encoding a GPR polypeptide of the 
present invention may be recombined with vector DNA in accordance 
with conventional techniques, including blunt-ended or 
staggered- ended termini for ligation, restriction enzyme digestion 
5 to provide appropriate termini, filling in of cohesive ends as 
appropriate, alkaline phosphatase treatment to avoid undesirable 
joining, and ligation with appropriate ligases. Techniques for such 
manipulations are disclosed, e.g., by Ausubel et al, supra, and are 
well known in the art. 

10 A nucleic acid molecule, such as DNA, is said to be 

"capable of expressing" a polypeptide if it contains nucleotide 
sequences which contain transcriptional and translational regulatory 
information and such sequences are "operably linked" to nucleotide 
sequences which encode the polypeptide. An operable linkage is a 

15 linkage in which the regulatory DNA sequences and the DNA sequence 
sought to be expressed are connected in such a way as to permit gene 
expression as GPR polypeptides in recoverable amounts. The precise 
nature of the regulatory regions needed for gene expression may vary 
from organism to organism, as is well known in the analogous art. 

20 See, e.g., Sambrook, supra and Ausubel supra. 

The present invention accordingly encompasses the 
expression of a GPR polypeptide, in either prokaryotic or eukaryotic 
cells, although eukaryotic expression is preferred. 

Preferred hosts are bacterial or eukaryotic hosts including 

25 bacteria, yeast, insects, fungi, bird and mammalian cells either in 
vivo, or in situ, or host cells of mammalian, insect, bird or yeast 
origin. It is preferred that the mammalian celj. or tissue is of 
human, primate, hamster, rabbit, rodent, cow, pig, sheep, horse, 
goat, dog or cat origin, but any other mammalian cell may be used. 

30 Further, by use of, for example, the yeast ubiquitin 

hydrolase system, in vivo synthesis of ubiquitin- transmembrane 
polypeptide fusion proteins may be accomplished. The fusion proteins 
so produced may be processed in vivo or purified and processed in 
vitro, allowing synthesis of a GPR polypeptide ot the present 

35 invention with a specified amino terminus sequence. Moreover, 
problems associated with retention of initiation codon- derived 
methionine residues in direct yeast (or bacterial) expression may be 
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avoided. Sabin et al., Bio/Technol . 7(7): 705-709 (1989); Miller et 
al., Bio/Technol. 7(7): 698-704 (1989). 

Any of a series of yeast gene expression systems 
incorporating promoter and termination elements from the actively 
5 expressed genes coding for glycolytic enzymes produced in large 
quantities when yeast are grown in mediums rich in glucose can be 
utilized to obtain GPR polypeptides of the present invention. Known 
glycolytic genes can also provide very efficient transcriptional 
control signals. For example, the promoter and terminator signals 
10 of the phosphoglycerate kinase gene can be utilized. 

Production of GPR polypeptides or functional derivatives 
thereof in insects can be achieved, for example, by infecting the 
insect host with a baculovirus engineered to express transmembrane 
polypeptide by methods known to those of skill. See Ausubel et al, 
15 eds. Current Protocols in Molecular Biology, Wiley Interscience, 
§§16.8-16.11 (1987, 1992). 

In a preferred embodiment, the introduced nucleotide 
sequence will be incorporated into a plasmid or viral vector capable 
of autonomous replication in the recipient host. Any of a wide 
20 variety of vectors may be employed for this purpose. See, e.g., 
Ausubel et al, supra, §§ 1.5, 1.10, 7.1, 7.3, 8.1, 9.6, 9.7, 13.4, 
16.2, 16.6, and 16.8-16.11. Factors of importance in selecting a 
particular plasmid or viral vector include: the ease with which 
recipient cells that contain the vector may be recognized and 
25 selected from those recipient cells which do not contain the vector; 
the number of copies of the vector which are desired in a particular 
host; and whether.it is desirable to be able to "shuttle" the vector 
between host cells of different species. 

Preferred prokaryotic vectors known in the art include 
3 0 plasmids such as those capable of replication in E . coli (such as, 
for example, pBR322, ColEl, pSClOl, pACYC 184, vrVX) . Such plasmids 
are, for example, disclosed by Maniatis, T., et al. {Molecular 
Cloning, A Laboratory Manual, Second Edition, cold Spring Harbor 
Press, Cold Spring Harbor, NY (1989); Ausubel et al, eds., Current 
35 Protocols in Molecular Biology, Wiley Interscience, New York, NY 
(1987, 1992)). Bacillus plasmids include pC194, pC221, pT127, etc. 
Such plasmids are disclosed by Gryczan, T. (In: The Molecular 
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Biology of the Bacilli, Academic Press, NY (1982), pp. 307-329). 
Suitable Streptomyces plasmids include pIJlOl {Kendall, K.J. , et al . , 
J. Bacteriol. 169:4177-4183 (1987)), and streptomyces bacteriophages 
auch as 0C31 (Chater, K.P., et al . , In: Sixth International 
5 Symposium on Actinomycetales Biology, Akademiai Kaido, Budapest, 
Hungary (1986), pp. 45-54). Pseudomonas plasmids are reviewed by 
John, J.F., et al. (Rev. Infect. Dis. 8:693-704 (198S) ) , and Izaki, 
K. (jpn. J. Bacteriol. 11:729-742 (1978); and Ausubel et al, supra ) . 

The expressed protein may be isolated and purified in 

10 accordance with conventional conditions, such as extraction, 
precipitation, chromatography, affinity chromatography, 
electrophoresis, or the like. For example, the cells may be 
collected by centrifugation, or with suitable buffers, lysed, and the 
protein isolated by column chromatography, for example, on 

15 DBAE- cellulose , phosphocellulose, polyribocytidylic acid-agarose, 
hydroxyapatite or by electrophoresis or immunoprecipitation. 
Alternatively, the transmembrane polypeptide or functional derivative 
thereof may be isolated by the use of anti - transmemorane polypeptide 
antibodies. Such antibodies may be obtained by well-known methods, 

20 some of which are mentioned below. These antibodies may be 
immobilized on cellulose, agarose, hollow fibers, or cellulose 
filters by covalent chemical derivatives by methods well known to 
those skilled in the art. 

As discussed herein, GPR polypeptides of the present 

25 invention may be further modified for purposes of drug design, such 
as for example to reduce immunogenicity, to prevent solubility and/or 
enhance delivery, or to prevent clearance or degradation. 

Appropriate modification of the primary amino acid sequence 
of GPR polypeptides of the present invention, obtained by mutagenesis 

30 or utilizing fragments of other related forms of G-protein 
transmembrane proteins, as described herein, will allow the creation 
of molecules which bind G-protein coupled receptors with higher 
affinity than that exhibited by naturally occurring transmembrane 
domains. Small polypeptides that are provided according to the 

35 present invention which polypeptides maintain G-protein coupled 
receptor binding inhibition activity, are expected to have two 
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advantages over larger polypeptides. These advantages include (i) 
greater stability and dif fusibility, and (2) less immunogenicity . 

Since polypeptides according to the present invention are - 
generally small (10-40, 20-30, 15-25, 30-45 amino acids), cell or 
5 tissue sources of G-protein coupled receptors are not required to . 
practice the present invention, since known polypeptide syntheses 
steps can be used without undue experimentation to provide GPR 
polypeptides or sequences substantially corresponding thereto. 
Pharmaceutic al Preparations 

1 0 Preparations of GPR polypeptides for parenteral 

administration include sterile aqueous or non- aqueous solutions, 
suspensions, and emulsions, which may contain auxiliary agents or 
excipients which are known in the art. Pharmaceutical compositions 
such as tablets and capsules can also be prepared according to 

15 routine methods. 

By the term "protection" from infection or disease as used 
herein is intended "prevention, » "suppression" or "treatment." 
"Prevention" involves administration of a GPR polypeptide, 
polypeptide derivative, or anti- idiotypic antibody prior to rhp 

20 induction of the disease. 

"Suppression" involves administration of the composition 
prior to the clinical appearand of the disease. 

"Treatment" involves administration of the protective 
composition after th e appearance of the disease. it will be 

25 understood that in human and veterinary medicine, it is not always 
possible to distinguish between "preventing" and "suppressing" since 
the ultimate inductive event or events may be unknown, latent, or the 
patient is not ascertained until well after the occurrence of the 
event or events. Therefore, it is common to use the term 

30 "prophylaxis" as distinct from "treatment" to encompass both 
"preventing" and "suppressing" as defined herein. The term 
"protection," as used herein, is meant to include "prophylaxis." 

At least one GPR polypeptide, antibody or anti -idiotypic 
antibody of the present invention may be administered by any means 

35 that achieve their intended purpose, for example, to treat GPR 
related pathologies, such as psychotic disorders, including 
schizophrenia, by inhibition of binding of Dopamine D = receptors 
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using a GPR polypeptide corresponding to a fragment or consensus 
portion of a dopamine D, transmembrane domain; in the form of a 
pharmaceutical composition. 

For example, administration of such a composition may be by 
5 various parenteral routes such as subcutaneous, intravenous, 
intradermal, intramuscular, intraperitoneal, intranasal, transdermal, 
or buccal routes. Alternatively, or concurrently, administration may 
be by the oral route. Parenteral administration can be by bolus 
injection or by gradual perfusion over time. 
10 A preferred mode of using a Gpr pharmaceutical composition 

of the present invention is by intravenous or parenteral application. 

A typical regimen for preventing, suppressing, or treating 
G-protein coupled receptor pathologies, such as dopamine receptor 
related schizophrenia, comprises administration of an effective 
15 amount of a GPR polypeptide, consensus sequence, or chemical 
derivative thereof, administered over a period of one or several 
days, up to and including between one week and about 24 months. 

It is understood that the dosage of a GPR polypeptide of 
the present invention administered in vivo or in vitro will be 
20 dependent upon the age, sex, health, and weight of the recipient, 
kind of concurrent treatment, if any, frequency of treatment, and the 
nature of the effect desired. The ranges of effective doses provided 
below are not intended to limit the inventors and represent preferred 
dose ranges. However, the most preferred dosage will be tailored to 
25 the individual subject, as is understood and determinable by one of 
skill in the art, without undue experimentation. 

The total dose required for each treatment may be 
administered by multiple doses or in a single dose. a GPR 
polypeptide or functional a chemical derivative thereof may be 
30 administered alone or in conjunction with other therapeutics directed 
to GPR related pathologies, such as a the dopamine receptor related 
pathology as a non limiting example, or directed to other symptoms 
of the disease. 

Effective amounts of the a GPR polypeptide or composition, 
35 which may also include a functional derivative thereof, or a GPR 
anti- idiotypic antibody, are from about 0.01 /ig to about 100 mg/kg 
body weight, and preferably from about 10 ^g to about 50 mg/kg body 
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weight, such 0.05, 0.07, 0.09, 0.1, 0.5, 0.7, 0.9, 1, 2, 5, 10, 20, 
25, 30, 40, 45, or 50 mg/kg. 

Preparations for parenteral administration include sterile 
aqueous or non-aqueous solutions, suspensions, and emulsions, which 
5 may contain auxiliary agents or excipients which are known in the 
art. Pharmaceutical compositions such as tablets and capsules can 
also be prepared according to routine methods. 

Pharmaceutical compositions comprising at least one GPR 
polypeptide of the present invention may 

10 include all compositions wherein the GPR polypeptide is contained in 
an amount effective to achieve its intended purpose, in addition to 
the GPR polypeptide, a pharmaceutical composition may contain 
suitable phannaceutically acceptable carriers, such as comprising 
excipients and auxiliaries which facilitate processing of the active 

15 compounds into preparations which can be used pharmaceutically . 

Pharmaceutical compositions include suitable solutions for 
administration intravenously, subcutaneously, dermally, orally, 
mucosally, rectally or may by injection or orally, and contain from 
about 0.01 to 99 percent, preferably from about 2C to 75 percent of 

20 active component (i.e. the antibody) together with the excipient. 
Pharmaceutical compositions for oral administration include tablets 
and capsules. Compositions which can be administered rectally 
include suppositories. 

Example 1: Synthesis of a G- Protein Tra nsmembrane Polypeptide and 
25 Consensus Polypeptide 

The polypeptides in Figs. 1-5 were synthesized using the 
following procedure and include the following characteristics. 

Peptide I (SEQ ID N0:1) , as shown in Fig. l, was used as a 
control for hydrophobic interaction alone as the mechanism of binding 

30 and was run in parallel with the test polypeptides described below. 
Polypeptide II (SEQ ID N0:2), as shown in Fig. 2, represents a 
membrane -spanning fragment of transmembrane segment III in the 
dopamine D 2 receptor. This particular fragment was chosen since it 
has been implicated in the 0- adrenergic receptor as having many 

35 residues which are involved in ligand binding interaction. 
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Polypeptide III (SEQ ID NO: 3), as shown in Fig. 3, represents the 
consensus polypeptide which was developed as a model for the dopamine 
IX, system and polypeptide IV (SEQ ID NO: 4) , as shown in Fig. 4, is 
a control for length dependence to show how critical the polypeptide 
5 length is in binding studies. Polypeptide V (SEQ ID NO:5), as shown 
in Fig. 5, is a consensus sequence of transmembrane domains of 
dopamine receptors D, and D 2 . 

The above polypeptides l-v (SEQ ID NOS:l-5), as shown in 
Figs. 1-5, respectively, were synthesized using solid phase synthesis 

10 on a Milligen 9600 polypeptide synthesizer using Fmoc amino acids 
(provided by Milligen/Biosearch) and PAL polystyrene resin 
(Milligen/Biosearch) . Coupling times were 1 hour and the 
polypeptides were cleaved by t r i f 1 uo r oac e t i c 
acid/phenol/H^O/thioanisole/ethanedi thiol (82.5:5:5:5:2.5) at room 

15 temperature for 2 hours. The filtrate was collected and washed with 
2 mil of trif luoroacetic acid (TFA) and 1 mL of dichlorome thane (DCM) . 
The filtrate was reduced in vacuo to 2 ml in volume and the resulting 
polypeptide was precipitated out by the addition of water. The 
polypeptides were then dissolved in 1, 1 , 1, 3 , 3,3 -hexaf luoro-2-propanol 

20 [(HFIP) Eastman]; lyophilized; and stored at -20°C until 
purification. Polypeptides I-V (SEQ ID N0S:l-5) , were purified using 
reverse-phase HPLC using a preparative Vydac C4 column (Vydac) at 
S0°C at a flow rate of 6.0 mL/min with a linear gradient of 0-100% 
B in a 60 min period at a UV detection wavelength of 275 nm. 

25 Due to the highly hydrophobic nature of these polypeptides, 

methanol was used with 0.1% (W/V) TFA and 0.5% (W/V) HFIP as solvent 
A and 2-propanol with 0.1% TFA as solvent B, in order to purify these 
polypeptides. Further purification was performed with an analytical 
C4 column (Vydac) with an isocratic gradient of 40% B at a flow rate 

30 of l ml/min. Identity of the polypeptides was confirmed by Fast-atom 
bombardment mass spectrometry and electrospray mass spectrometry and 
amino acid analysis. Stock solutions of polypeptides were made in 
HFIP and stored at -20°- 80°C. 

Circular Dichrois m (CD) . Spectra were recorded on an Aviv 

35 model 60 DS circular dichroism spectrophotometer at room temperature 
with a 1 cm by 1 mm cell. The amplitude of the CD signal was 
calibrated using 1 0.1% (w/v) solution of d ( + ) -camphorsulfonic acid 
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(Aldrich) and the wavelength of the CD signal was set using standard 
absorbance peaks of benzene vapor. Polypeptide concentrations were 
determined in a Cary 210 UV spectrophotoraer with the absorbance . 
measured at 280 nm. Helical content was estimated using CD signal 
5 intensity according to the method of Chen, et al Biochem . 12:3350- 
3359 (1974) . This calculation compares the experimental ellipticity 
at 222 nm ([9] 222) ([6]} to a theoretical [8] 222. The theoretical 
[8] 222 is empirically adjusted to account for differences in 
polypeptide length and is based on experimental CD data from a series 

10 of proteins with known crystal structures. Since both the curve 
shape and magnitude are important in analysis of a CD spectrum for 
secondary structure contributions, we also considered qualitatively 
the contributions to the spectral shapes from different secondary 
structures using reference curves for poly (L- lysine) . 

15 Fig. 6 shows a CD spectrum of the consensus polypeptide III 

(SEQ ID NO: 3) demonstrating that the polypeptide III is only 
partially helical in a solvent system in which most membrane 
polypeptides are strongly helical . 

Preparation of Small Unil amellar Vesicles . Polypeptides 

20 were incorporated into DMPC vesicles at lipid:peptide ratio of 147:1 
in the following manner: polypeptide in HFIP was mixed with 
dimyrystyroyl - phosphatidylcholine (synthetic) (DMPC) in dry 
chloroform and dried to a film with a stream of dry nitrogen at 0°C. 
This residue was then dried further overnight under a vacuum (1 x 10* 2 

25 torr) . The residue was then hydrated in 100 mM NaCl and sonicated 
for a 30-min period under nitrogen at 0°C. The suspension was 
sedimented for a 30-min at 100,000 g (4°C) to remove any residual 
titanium particles and large unilamellar vesicles. The supernatant 
was removed and sedimented once more at 159,000 g for a 45 min period 

30 at 4°C. The supernatant in the lower portion was used immediately. 4 
This basic procedure has been shown to reliably produce small 
unilamellar vesicles. 

Radioligand Binding Assays. A 0.50 mL volume of 1.00 nM 
[ 3 H] -spiperone [New England specific activity 21.4 Ci/mmol) was added 

35 to assay tubes which contained 0.5 mL lipid/peptide supernatant, 0.5 
mL Tris buffer pH 7 . 4 and 0.5 mL of cold drug for a final volume of 
2.0 mL. Nonspecific binding was defined in the presence of l uM of 
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{ + ) butaclamol or 1 uM spiperone. Appropriate controls for lipid 
vesicles containing no polypeptide were also run. Assay tubes were 
prepared in triplicate and the mixture was incubated for 1 hat 25°C. 
Incubation was terminated by filtration through filters presoaked in 
5 0.1% polyethyleneimine (w/v, Sigma) for at least 1 h prior to use. 

Filters were then washed with 6.0 mL of cold 50 mM Tris-HCl 
buffer, pH 7.40. For detection of radioactivity, filters were placed 
in 2.0 mL of scintillation fluid (Scintiverse) and incubated for 24 
h. The activity of the tritium was determined in a Beckman LS 7500 

10 liquid scintillation counter. Specif ic binding of [ 3 H] - spiperone was 
defined as the difference in binding in the presence and absence of 
unlabeled (+) butaclamol. 

Fig. 7 shows results of radioligand binding assays 
comparing polypeptide I {SEQ ID NO:l) as a control unit polypeptide 

15 III (SEQ ID NO: 3} according to the present invention. Polypeptide 
III (SEQ ID NO: 3) is shown to unexpectedly provide receptor-like 
functional binding, as demonstrated by binding to the neuroleptic 
agent, spiperone, into a stereoselective, concentration-dependent 
manner. 

20 It has also been demonstrated that as little as 0.1% of a 

GPR polypeptide according to the present invention is able to form 
a receptor-like functional binding site. Thus, a GPR polypeptide of 
the present invention is unexpectedly shown to act both as GPR 
ligands and GPR binding sites. 

25 All references cited herein, including journal articles or 

abstracts, published or corresponding U.S. oi foreign patent 
applications, issued U.S. or foreign patents, or any other 
references, are entirely incorporated by reference herein, including 
all data, tables, figures, and text presented in the cited 

30 references. Additionally, the contents of the references cited 
within the references cited herein are also entirely incorporated by 
reference. 

Reference to known method steps, conventional methods 
steps, known methods or conventional methods is not in any way an 
35 admission that any aspect, description or embodiment of the present 
invention is disclosed, taught or suggested in the relevant art. 
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The foregoing description of the specific embodiments will 
so fully reveal the general nature of the invention that others can, 
by applying knowledge within the skill of the art (including the , 
contents of the references cited herein) , readily modify and/or adapt 
5 for various applications such specific embodiments, without undue „ 
experimentation, without departing from the generic concept of the 
present invention. Therefore, such adaptations and modifications are 
intended to be comprehended within the meaning and range of 
equivalents of the disclosed embodiments, based on the teaching and 
10 guidance presented herein. It is to be understood that the 
phraseology or terminology herein is for the purprse of description 
and not of limitation, such that the terminology or phraseology of 
the present specification is to be interpreted by the skilled artisan 
in light of the teachings and guidance presented herein. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION : 

(i) APPLICANT: Murphy, Randall B. 

Schuster, David I. 

<ii) TITLE OF INVENTION: POLYPEPTIDES OF G- COUPLED RECEPTOR PROTEINS, AND 
COMPOSITIONS AND METHODS THEREOF 
(iii) NUMBER OF SEQUENCES: 95 
(iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: BROWDY AND NEIMARK 

(B) STREET: 419 Seventh Street, N.W. 

(C) CITY: Washington 

(D) STATE: D.C. 
<E> COUNTRY: USA 
m ZIP • 20004 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC - DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, version #1.25 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/943,236 
<B> FILING DATE: 10-SBP-1992 
(C) CLASSIFICATION: 
(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Town send, Kevin G. 
<B) REGISTRATION NUMBER: 34,033 
(C) REFERENCE /DOCKET NUMBER: MURPHY=2 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-628-5197 

(B) TELEFAX: 202-737-3528 

(C) TELEX: 248633 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS i 

(A) LENGTH: 24 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: peptide 



Leu Leu Leu Ser Leu Tyr Tyr Tyr 
20 

INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



Leu Asn Leu Ser Ala lie Ser Leu Lys Lys Lys 
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INFORMATION FOR SEQ ID NO:3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Asp Tyr Ala He Phe Val Leu Tyr Ala Ser Ala Trp Leu Ser Phe Asn 
15 10 15 

Cye Pro Phe He Val Thr Leu Asn lie Lys 
20 25 

INFORMATION FOR SEQ ID NO:4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE : amino acid 

{C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Lys Ala Val Val Tyr Ser Ser He Val Ser Phe Tyr Val Phe He Asp 



INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Asp Cys Asp val Phe Val Phe Val Asp He Met Leu Cys Thr Ala Ser 
15 10 15 

He Phe Asn Leu Cys Ala He Ser Val Gly Lys 

20 25 

INFORMATION FOR SEQ ID NO:S: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ser Leu Val Leu Leu Leu Phe Ala Asp Phe Ser Ser Met Leu Gly Cys 
15 10 15 

Met Ala Val Leu He Gly Phe Trp Arg Leu Lys Leu Leu Arg Asn His 
20 25 30 

Val Thr Lys Val He Ala Cys Phe Cys Ala Thr Ser Phe Cys Lys Asp 
35 40 45 

Phe Pro Ser Thr He Leu Thr Leu Thr Asn Thr Ala Val Asn Gly Gly 
50 55 60 

Phe Pro Cys Tyr Leu Tyr Ala He Val He Thr Tyr Gly Ser Phe Ala 
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Cys Trp Leu Trp Thr Leu lie Cys Leu Ala lie Ser lie Tyr Met Leu 



Ser Ala Val Leu Val Gly Leu Thr Ser Arg Tyr Thr Tyr Trp He His 
1S5 170 175 



Leu His Thr Tyr Leu Ser Val Ser His Gly Phe Trp Ala Ser Val Thr 
225 230 235 240 



(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Val Tyr He Thr Val Glu Leu Ala He Ala Val Leu Ala Thr Leu Gly 
15 10 15 

Aen Val Leu Val Cys Trp Ala Val Trp Leu Asn Ser Asn Leu Asn Val 
20 25 30 

Thr Asn Tyr Phe val Val Ser Leu Ala Ala Ala Asp He Ala Val Gly 
35 40 45 

Val -He Ala He Pro Phe Ala He Thr He Set Thr Gly Phe Cys Ala 
50 55 60 

Ala Cys His Asn Cys Leu Pha Phe Ala Cys Phe Val Leu Val Leu Thr 
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Gin Ser Ser He Phe Ser Leu Leu Ala He Ala He Asp Arg Tyr lie 



Glu Asp Val Val Pro Met Asn Tyr Met Val Tyr Tyr Asn Phe Phe Ala 
165 170 175 



lie He Val Gly Leu Phe Ala Leu Cys Trp Leu Pro Leu His He He 
225 230 235 240 



Gly Thr Ser Ala Arg Ala Leu Ala Ala His Gly Ser Asp Gly Glu Gin 
305 310 315 320 

He Ser Leu Arg Leu Asn Gly His Pro Pro Gly val Trp Ala Asn Gly 
325 330 335 

Ser Ala Pro His Pro Glu Arg Arg Pro Asn Gly Tyr Thr 
340 345 

(2) INFORMATION FOR SEQ ID N0:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS : single 
(D> TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Ala Tyr lie Gly He Glu Val Leu He Ala Leu Val Ser Val Pro Gly 
15 10 15 

Trp Leu Val He Trp Ala Val Lye Val Asn Gin Ala Leu Arg Asp Ala 
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Thr Phe Cys Phe lie val Ser lie Ala Val Ala Asp Val Ala Val Glv 
35 40 45 

Ala Leu Val lie Pro Leu Ala lie Leu He Asn He Gly Pro Arg Thr 
50 55 60 

Tyr Phe His Thr Cys Leu Met Val Ala Cys Pro Val Leu He Leu Thr 

65 70 75 80 

Gin Ser Ser He He Ala Leu Leu Ala He Ala Val Asp Arg Tyr Leu 
85 90 95 

Arg Val Lye He Pro Leu Arg Tyr Lys Thr Val Val Thr Pro Arg Arg 
100 105 no 

Ala Ala Val Ala He Ala Gly Cys Trp He Leu Ser Phe Val Val Gly 
115 120 125 

Leu Thr Pro Leu Phe Gly Trp Asn Arg Leu Gly Glu Ala Gin Arg Ala 
130 135 140 



Glu Lys Val He Ser Met Glu Tyr Met Val Tyr Phe Asn Phe Phe Val 
165 170 175 

Trp Val Leu Pro Pro Leu Leu Leu Met Val Leu He Tyr Leu Glu Val 
180 185 190 

Phe Tyr Leu He Arg Arg Gin. Leu Gly Lys Lys Val Ser Ala Ser Ser 
135 200 205 

Gly Asp Pro Gin Lys Tyr Tyr Gly Lys Glu Leu Lye He Ala Lys Ser 
210 215 220 

Leu Ala Leu He Leu Phe Leu Phe Ala Leu Ser Trp Leu Pro Leu His 
225 230 235 240 

He He Asn Cys He Thr Leu Phe Cys Pro Ser Cys Arg Lys Pro Ser 
245 250 255 

He Leu Met Tyr He Ala He Phe Leu Thr His Gly Asn Ser Ala Met 
260 2S5 270 



Lye He Trp Asn Asp His Phe Arg Cys Gin Pro Thr Pro Pro Val Asp 
290 295 300 



(2) INFORMATION FOR SEQ ID N0:9: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 342 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
til) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Val Ala Phe He Gly He Thr Thr Gly Leu Leu Ser He Ala Thr Val 
1 5 10 15 

Thr Gly Asn Leu Leu Val Leu He Ser Phe Lys Val Asn Thr Glu Leu 
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- 60 - 

20 25 30 

Lys Thr Val Asn Asn Tyr Phe Leu Leu Ser He Ala Cys Ala Asp Leu 
35 40 45 

He He Gly Thr Phe Ser Met Leu Tyr Leu Leu Met His Trp Ala Leu 
50 55 SO 

Gly Thr Leu Ala Cys Asp Leu Trp Leu Ala Leu Asp Tyr Val Ala Ser 
65 70 75 80 

Asn Ala Ser Val Leu Asn Leu Leu Leu He Ser Phe Asp Arg Tyr Phe 
85 90 95 



Lys Arg Arg Trp Arg Lys He Pro Lys Arg Pro Gly Ser Val His Arg 
325 330 335 



(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 amino acids 

(E) TYPE: amino acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
Ui) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Val Val Phe He Val Leu Val Ala Gly Ser Leu Ser Leu Val Thr He 
5 1 S io is 

lie Gly Asn He Leu Val Met Val Ser He Lys Val Asn Arg His Tyr 
20 25 30 

Phe Leu Phe Ser He Ala Cys Ala Asp Leu He He Gly Val Phe Ser 
35 40 45 

10 Met Asn Leu Tyr Thr Leu Tyr Thr Val lie Gly Tyr Trp Pro Leu Gly 

50 55 60 

Pro Val val Cys Asp Leu Tyr Val Val Sar Asn Ala Ser Val Met Asn 
62 70 75 80 

Leu Leu He He Ser Phe Asp Arg Tyr Phe Cys Val Thr Lys Pro Leu 
15 85 go 95 

Thr Tyr Pro Val Lys Arg Thr Thr Lys Met Ala Gly Met Met He Ala 
100 105 110 

Ala Ala Trp Val Leu Ser Phe He Leu Trp Ala Pro Ala He Leu Phe 
115 120 125 

20 Trp Gin Phe He Val Gly Val Arg Thr Val Glu Asp Gly Glu Cys Tyr 

130 135 140 

He Gin Phe Phe Ser Asn Pro Ala Val Thr Phe Gly Thr Ala He Ala 
145 150 155 ISO 

Ala Phe Tyr Leu Pro Val lie He Met He Val Leu Tyr Trp His He 
25 165 170 175 

Ser Arg Ala Ser Lys Ser Arg He Lys Lys Asp Lys Lys Glu Pro Val 
180 185 190 

Ala Asn Gin Asp Pro Val Ser Pro Ser Leu Val Gin Gly Arg He Val 
195 200 205 

30 Lys Pro Leu Ser Ser Asp Asp Lys He Val Arg Arg Thr Lys Gin Pro 

210 215 220 

Ala Lys Lys Lys Pro Pro Pro Ser Arg Glu Lys Lys Val Thr Arg Thr 
225 230 235 240 

, _ Ile Ala He Leu Leu Ala Phe He He Thr Trp Ala Pro Tyr Asn Val 

J5 245 250 255 

Met Val Leu He Asn Thr Phe Cys Ala Pro Cys Ile Pro Asn Thr Val 
260 265 270 

Trp Arg He Gly Tyr Trp Leu Cys Tyr He Asn Ser Thr He Asn Pro 
275 280 285 

40 Ala Cys Tyr Ala Leu Cys Asn Ala Thr Phe Lys Lys Thr Phe Lys His 

290 295 300 

Leu Ile Met Cys His Tyr Lys Asn Ile Gly Ala Thr Arg 
305 310 315 

(2) INFORMATION POP. SEQ ID NO: 11: 
45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 amino acids 
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(B> TYPE: amino acid 
fC> STRANDEDNESS : Single 
<D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Trp Phe He Ala Phe Leu Thr Gly lie Leu Ala Leu Val Thr He He 

15 10 15 

Gly Asn He Leu Val He Val Ser Phe Lys val Asn Lys Gin Leu Lya 
20 25 30 

Thr Val Asn Asn Tyr Phe Leu Leu Ser Leu Ala Cys Ala Asp Leu He 
35 40 45 

He Gly Val He Ser Met Asn Leu Phe Thr Thr Tyr He He Met Asn 
50 55 60 

Arg Trp Ala Leu Gly Asn Thr Ala Cys Asp Leu Trp He Ala He Asp 
65 70 75 80 

Tyr Val Ala Ser Asn Ala Ser Val Leu Asn Leu Leu Val He Ser Phe 



Abh Pro Val Cys Tyr Ala Leu Cys Asn Lye Thr Phe Arg Thr Thr Phe 
305 310 315 320 

Lys Thr Leu Leu Leu Cys Gin Cys Asp Lys Arg Lys Arg Arg Lys Gin 
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Gin Tyr Gin Gin Arg Gin Ser Val lie Phe His Lys Arg Val Pro Glu 
340 345 350 



(2) INFORMATION FOR SEQ ID NO: 12; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 amino acids 

(B) TYPE: amino acid 

(C) STEANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Val Phe He Ala Thr Val Arg Gly Ser Leu Ser Leu Val Thr Val 
1 5 10 15 

Val Gly Asn lie Leu Val Met Leu Ser He Lys Val Asn Arg Gin Leu 
20 25 30 

Gin Thr Val Asn Asn Tyr Phe Leu Phe Ser He Ala Cys Ala Asp Leu 
35 40 45 

He He Gly Ala Phe Ser Met Asn Leu Tyr Thr Val Tyr He lie Lys 
50 55 SO 

Gly Tyr Trp Pro Lau Gly Ala Trp Cys Asp Leu Trp Leu Ala Leu Aap 
65 70 75 80 

Tyr Val Val Ser Asn Ala Ser Val Met Leu Leu He He Ser Phe Asp 
85 90 95 

Arg Tyr Phe Cys Val Thr Lys Pro Leu Thr Tyr Pro Ala Arg Arg Thr 
100 105 11 

Thr Lys Mat Ala Gly He Met He Ala Ala Ala Trp Val Leu Ser Phe 
H5 120 125 

Val Leu Trp Ala Pro Ala He Leu Phe Trp Gin Phe Val Val Gly Lys 
130 135 140 

Arg Thr Val Pro Asp Asn Gin Cys Phe He Gin Phe Leu Ser Asn Pro 
145 150 1S5 160 

Ala Val Thr Phe Gly Thr Ala He Ala Ala Phe Tyr Leu Pro Val Val 
lfi5 170 175 

He Met He Val Leu Tyr He His He Ser Leu Ala S B r Arg Ser Arg 
180 185 190 

Val His Lys His Arg Pro Glu Gly Pro Lys Glu Lys Lys Ala Lys Thr 
195 200 205 

He Ala Phe Leu iys Ser Pro He Met Gin Ser Val Lye Lys Pro Pro 
210 215 220 

Pro Gly Glu Ala Lys Phe Ala Ser He Ala Arg Asn Gin Val Arg Lys 
225 230 235 240 

Lys Arg Gin Leu Ala Ala Arg Glu Arg Lys Val Thr Arg Thr He Phe 
245 250 255 

Ala He Leu Leu Ala Phe He Leu Thr Trp Thr Pro Tyr Asa Val Met 
2S0 265 270 
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Val Leu Val Asn Thr Phe Cys Gin Ser Cys lie Pro Asp Thr Val Trp 
275 280 285 

Ser lie Gly Tyr Trp Leu lie Cys Tyr Val Asn Ser Thr He Asn Pro 
290 295 300 

Ala Cys Tyr Ala Leu Cys Asn Ala Thr Phe Lys Lys Thr Phe Arg His 
305 310 315 320 

Leu Leu Leu Cys Qln Arg Tyr Asn He Gly Thr Ala Arg 
325 330 

(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 348 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Val He Thr He Ala Val Val Thr Ala Val Val Ser Leu Met Thr He 
1 5 10 15 

Val Gly Asn Val Leu Val Met He Ser Phe Lys Val Asn Ser Gin Leu 
20 25 30 

Lys Thr Val Asn Asn Tyr Tyr Leu Leu Ser He Ala Cys Ala Asp Leu 
35 40 45 

He He Gly He Phe Ser Met Asn Leu Tyr Thr Thr Tyr He Leu He 
50 55 60 

Met Gly Arg Trp Ala Leu Gly Ser Leu Ala Cys Asp Leu Trp Leu Ala 
65 70 75 80 

He Asp Tyr Val Ala Ser Asn Ala Ser Val Leu Asn Leu Leu Val He 
85 90 95 



Val Gly Lys Arg Thr Val Pro He Asp Glu Cys Gin He Gin Phe Leu 
145 150 155 160 

Ser Glu Pro Thr He Thr Phe Gly Thr Ala He Ala Ala Phe Tyr He 
165 170 175 

Pro Val Ser He Met Arg He Lsu Tyr Cys Arg He Tyr Arg 31u Thr 
160 185 190 



Cys Leu Arg Cys Pro Arg Pro Thr Lys Gly Leu Asn Pro Asn Pro Ser 
225 230 235 240 

His Gin Met Thr lys Arg Lys Arg Met Ser Leu Val Lys Glu Arg Lys 
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- 65 - 
250 



(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 377 amino acids 
<B) TYPE: amino acid 
<C) STRANDEDNESS i single 
(D) TOPOLOGY: linear 
<ii) MOLE COLE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Thr Ala Gly Asp Cys Leu He Met Leu He Val Leu Leu He Val Ala 
15 10 15 

Gly Asn Val Leu Val He Val Ala He Ala Lys Thr Pro Arg Leu Gin 
20 25 30 

Thr Leu Thr Asn Leu Phe He Met Ser He Ala Ser Ala Asp Leu val 
35 40 45 

Met Leu Leu Leu Val Val Pro Phe Cys Ala Thr Leu Val Val Trp Gly 
SO 55 eo 

Arg Trp Glu Tyr Gly Ser Phe Phe Cys Glu Leu Trp Thr Ser Val Asp 
65 70 75 80 

Val Leu Cys Val Thr Ala Ser He Glu Thr Leu Cys Val He Ala Leu 
85 30 95 
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Ser Pro Ser Pro Ser Pro Val Pro Ala Pro Ala Pro Pro Gly Pro Pro 
210 215 220 

Arg Pro Ala Ala Ala Ala Ala Thr Ala Pro Leu Ala Asn Glv Arg Ala 
225 230 235 240 

Gly Lys Arg Arg Pro Ser Arg Leu Val Ala Leu Arg Glu Gin Lys Ala 
245 250 255 

Leu Lys Thr Leu Gly lie He Met Gly Val Phe Thr Leu Cys Trp Leu 
2G0 265 270 

Pro Phe Phe His Arg Glu Leu Val Pro Asp Arg Leu Phe Val Phe Phe 
275 280 295 

Asn Trp Leu Arg Tyr Ala Asn Ser Ala Phe Aan Pro He He Tyr Cye 
290 295 300 

Arg Ser Pro Asp Phe Arg Lys Ala Phe Gin Gly Leu Leu Cys Cye Ala 
3 °5 310 315 320 

Arg Arg Ala Ala Arg Arg Arg Hia Ala Thr His Gly Asp Arg Pro Arq 
325 330 335 

Ala Ser Gly Cys He Ala Arg Pro Gly Pro Pro Ser Pro Gly Ala Ala 
340 345 350 

Ser Asp Asp Asp Asp Asp Asp Val Val Gly Ala Thr Pro Pro Ala Arg 
355 360 365 



(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Val Val Gly He Val Met Ser Leu He Val Leu Ala He Val Phe Glv 
1 5 io 1S 

Asn Val Leu Val He Thr Ala He Ala Lys Phe Glu Arg Leu Gin Thr 
20 25 30 

Val Thr Asn Tyr Phe He Thr Ser He Ala Cys Ala Asp Leu Val Met 
35 40 45 

Gly Leu Ala Val Val Pro Phe Gly Ala Ala His He Leu Met Lys Met 
50 55 SO 

Trp Thr Phe Gly Asn Phe Trp Cys Glu Phe Trp Thr Ser He Asp Val 
65 70 75 80 

Leu Cys Val Thr Ala Ser He Glu Thr Leu Cys Val He Ale. Val Asp 
85 90 95 

Arg Tyr Phe Ala He Thr Ser Pro Phe Lys Tyr Gin Ser Leu Leu Thr 
100 105 no 

Lys Asn Lys Ala Arg Val He He He Met Val Trp lie Val Ser Gly 
115 120 125 

Leu Thr Ser Phe Leu Pro He Leu Tyr Arg Ala Thr Hia Gin Glu Ala 
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lie Asn Cya Tyr Ala Asn Glu Thr Cys Cys Asp Phe Phe Thr Asn Gin 

145 150 155 160 

Ala Tyr Ala Ala Ser Ser Ala Val Ser Phe Tyr Val Pro Leu Val lie 

1S5 170 175 



Gin Lys He Asp Lys Ser Glu Gly Arg Phe He Phe Val Gin Asn Leu 
195 200 205 



Ser Lys Phe Cys Leu Lys Glu His Lys Ala Leu Lys Thr Leu Gly He 
225 230 235 240 

He Pro Cys Thr Phe Thr Leu Cys Trp Leu Pro Phe Phe He Val Asn 
245 250 255 

He Val Val He Gin Asp Asn Leu He Arg Lys Glu Val Tyr He Leu 
260 265 270 



Asn Thr Gly Glu Gin Ser Gly Tyr His Val Glu Gin Glu Lys Glu 1 
325 330 335 



30 (2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 amino acids 

(B) TYPE: amino acid 

(C) STRRMDEDNESS : single 
35 (D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

Ala Ala Leu Ala Gly Ala Leu Leu Ala Leu Ala Val Leu Ala Thr Val 
15 10 15 

40 Gly Gly Asn Leu Leu Val He Val Ala He Ala Trp Thr Pro Arg Leu 

20 25 30 

Gin Thr Met Thr Asn Val Phe Val Thr Ser Leu Ala Ala Ala Asp Leu 
35 40 45 

Asp Leu Leu Val Val Pro Pro Ala Ala Thr Leu Ala Leu Thr Gly His 
45 50 55 60 

Trp Pro Leu Gly Ala Thr Gly Cys Glu Leu Trp Thr Ser Val Asp Val 
65 70 75 80 
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Leu Cys Val Thr Ala Ser He Glu Thr Leu Cys Ala He Ala val Asp 



He Tyr Cys Arg Ser Pro Asp Phe Arg Ser Ala Phe Arg Arg Leu Leu 
305 310 315 320 



Pro Ala Leu Phe Pro Ser Gly Val Pro Ala Ala Glu Ser Ser Pro Ala 
340 345 350 



(2) INFORMATION FOR SEQ ID HO: 17: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 
(it) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Ala He Leu Leu Gly Val He Leu Gly Gly Leu He Leu Phe Gly Val 

15 10 15 

Leu Gly Asn He Leu Val He Leu Ser Val Ala Cys His Arg His Leu 
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69 



20 



25 



His Ser Val Thr His Tyr Tyr lie Val Asn 
3S 40 

Leu Leu Thr Ser Thr Val Leu Pro Phe Ser 
50 55 

Gly Tyr Trp Lys Phe Gly Arg Val Phe Cys 
65 70 

Asp Val Leu Cys Cys Thr Ala Ser He Met 



85 



90 



30 

Leu Ala Val Ala Asp Leu 

45 

Ala He Phe Glu He Leu 
60 

Asn Val Trp Ala Ala Val 
75 80 

Leu Leu Cys He He Ser 
95 



Leu Phe Ser Ala Leu Gly Ser Phe Tyr Val 1 
165 170 



i Lye Arg Glu Ser Arg Gly 



Ala Pro Ser His Val Leu Glu Gly Gin His I 
340 345 
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(2) INFORMATION FOR SEQ ID NO: 18: 
<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 370 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
Ui) MOLE COLE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:18: 

Ala He Ser Val Gly Leu Val Leu Gly Ala Phe He Leu Phe Ala He 
15 10 15 

Val Gly Asn He Leu Val He Leu Ser Val Ala Cys Asn Arg His Leu 
20 25 30 

Arg Thr Pro Thr Asn Tyr Phe He Val Asn He Ala He Ala Asp Leu 
35 40 45 

Leu Leu Ser Phe Thr Val Leu Pro Phe Ser Ala Thr Leu Glu Val Leu 
50 55 60 

Gly Tyr Trp Val Leu Gly Arg He Phe Cys Asp He Trp Ala Ala Val 
65 70 75 80 

Asp Val Leu Cys Cys Thr Ala Ser He Leu Ser Leu Cys Ala He Sar 
85 90 95 

He Asp Arg Tyr He Gly Val Arg Tyr Ser Leu Gin Tyr Pro Thr Leu 
100 105 110 



Pro Abb Asp Asp Lys Glu Cys Val Thr Glu Glu Pro Phe Leu Phe Cys 
145 150 155 ISO 



Lys Gly His Asn Pro Arg Ser Ser He Ala Val Lys Leu Phe Lye Phe 

225 230 235 240 

Ser Arg Glu Lys Lys Ala Ala Lys Thr Leu Gly Ha Val Val Gly Trp 
245 250 255 

He Leu Cys Trp Leu Pro Phe Phe He Ala Leu Pro Leu Gly Ser Leu 
260 265 270 



Lys Glu Phe Lys Arg Ala Leu Leu Gly Cys Gin Cys Arg Gly Gly Arg 
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- 71 - 

305 310 315 320 

Arg Arg Arg Arg Arg Arg Arg Leu Ala Cys Ala Tyr Thr Tyr Arg Pro 
325 330 335 



10 (2) INFORMATION FOR SEQ ID HO: 19: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 330 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



20 


Val 

1 

Val 
Arg 


Ala 
Gly 
Ala 


Gly 

Asn 

Pro 
35 


Leu 
val 

20 

Gin 


Ala 
5 

Leu 
Asn 


Ala 
val 
Leu 


Val 
Val 
Phe 


Val 

He 

Leu 
40 


25 


Leu 


Val 
50 


Ala 


Thr 


Leu 


Val 


Met 

55 


Pro 




Tyr 

es 


Trp 


Tyr 


Phe 


Gly 


Gin 
70 


val 


Trp 




val 


Leu 


Phe 


Cys 


Thr 
85 


Ser 


Ser 


lie 


30 


Asp 


Arg 


Tyr 


Trp 
100 


Ser 


Val 


Thr 


Gin 




Thr 


Pro 


Arg 
115 


Arg 


Val 


Lys 


Ala 


Thr 
120 


35 


Ala 


Val 
130 


lie 


Ser 


Phe 


Pro 


Pro 
135 


Leu 




Gly 
145 


Ala 


Ala 


Tyr 


Pro 


Gin 

150 


Cya 


Gly 




Leu 


Ser 


Ser 


Cys 


He 

165 


Gly 


Ser 


Phe 


40 


Leu 


Val 


Tyr 


Ala 
180 


Arg 


He 


Tyr 


Arg 




Leu 


Ser 


Qlu 
195 


Lys 


Arg 


Ala 


Pro 


Val 
200 


45 


Thr 


Glu 
210 




Gly 


Leu 


Gly 


Ala 
215 


Ala 




Ala 

225 


Arg 


Phe 


Leu 


Ser 


Arg 

230 


Arg 






Arg 


Lys 


Val 


Ala 


Gin 


Ala 


Arg 


Glu 



WO 94/05695 



PCT/US93/08528 







245 






250 




255 




Leu Val Phe 


Val Leu 
260 


Cys 


Trp 


Phe Pro Phe Phe 


Phe 


He Tyr Ser Leu 

270 


5 


Tyr Gly He 
275 


Cys Arg 


Slu 


Ala 


Cys Gin val Pro 
280 


Gly Pro Leu Phe Lys 
285 




Phe Phe Phe 
290 


Trp He 


Gly 


Tyr 
295 


Cys Asn Ser Ser 


Leu 
300 


Asn Pro Val He 




Tyr Thr Val 
305 


Phe Aan 


Gin 
310 


Afip 


Phe Arg Pro Ser 
315 


Phe 


Lys His He Leu 
32D 


10 


Pha Arg Arg 


Arg Arg 
325 


Arg 


Gly 


Phe Arg Gin 
330 







(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330 amino acids 
<B) TYPE: amino acid 
[C> STHANDEDNESS : single 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Thr Ala Ala He Ala Ala Ala He Thr Phe Leu He Leu Phe Thr He 
1 S 10 is 

Phe Gly Asn Ala Leu Val He He Ala Val Leu Thr Ser Arg Ser Leu 
20 25 30 

Arg Ala Pro Gin Asn Leu Phe Leu Val Ser He Ala Ala Ala Asp He 
35 40 45 

Leu Val Ala Thr Leu He He Pro Phe Ser Leu Ala Asn Glu Leu Leu 
50 55 60 

Gly Tyr Trp Tyr Phe Arg Arg Thr Trp Cys Glu Val Tyr Leu Ala Leu 
55 70 75 ' go 

Asp Val Leu Phe Cys Thr Ser Ser He Val His Leu Cys Ala He Ser 
85 90 95 

Leu Asp Arg Tyr Trp Ala Val Ser Arg Ala Leu Glu Tyr Asn Ser Lys 
100 105 no 

Arg Thr Pro Arg Arg He Lys Cys He He Leu Thr Val Trp Leu He 
115 120 125 

Ala Ala Val He Ser Leu Pro Pro Leu He Tyr Lys Gly Asp Gin Gly 
!30 135 140 

Pro Gin Pro Arg Gly Arg Pro Gin Cys Lys Leu Asn Gin Glu Ala Trp 
145 150 155 iso 

Tyr He Leu Ser Ser He Gly Ser Phe Phe Ala Pro Cys Leu He Leu 
165 170 175 

Leu Val Tyr Leu Arg He Tyr Leu He Ala Lys Arg Ser Asn Arq Aro 
ISO 185 iso 

Gly Pro Arg Ala Lys Cys Gly Pro Gly Gin Gly Glu Ser Lys Gin Pro 
195 200 205 

Arg Pro Asp His Gly Gly Ala He Ala Ser Ala Lys Leu Pro Ala He 
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Ala Ser Gly Arg Gly Val Gly Ala lie Gly Gly Gin Trp Trp Arg Arg 
225 230 235 240 

Arg Ala His Val TJir Arg Glu Lys Arg Phe Thr Phe Val Leu Ala Val 
245 250 255 

Val lie Gly Val Phe Val Leu Gys Trp Phe Pro Phe Phe Phe Ser Tyr 
260 265 270 

Ser Leu Gly Ala lie Cys Pro Lys His Cys Lya Val Pro His Gly Leu 
275 280 285 

Phe Gin Phe Phe Phe Trp lie Gly Tyr Cys Asn Ser Ser Leu Asn Pro 
290 255 300 

Val lie Tyr Thr lie Phe Asn Gin Asp Phe Arg Met Phe Arg Arg lie 
3 °5 310 315 320 

Leu Cys Arg Pro Trp Thr Gin Thr Ala Trp 
325 330 

(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDSESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Thr Leu Thr Leu val Cys He Ala Cys Leu Ser Leu Thr Val Phe Gly 
1 5 10 15 

Asn Val Leu Val He He Ala Val Phe Thr Ser Arg Ala Leu Lys Ala 
20 25 30 

Pro Gin Asn Leu Phe Leu Val Ser He Ala Ser Ala Asp He Leu Val 
35 40 45 

Ala Thr Leu Val He Pro Phe Ser Leu Ala Asn Glu Val Asn Gly Tyr 
50 55 60 

Trp Tyr Phe Gly Lys Trp Cys Glu He Tyr Leu Ala Leu Asp Val Leu 
65 70 75 80 

Phe Cys Thr Ser Ser He Val His Leu Cys Ala He Ser Leu Asp Arq 
85 so 95 

Tyr Trp Ser He Thr Gin Ala He Glu Tyr Asn Leu Lys Arg Thr Pro 
100 105 no 

Arg Arg He Lys Ala He He He Thr Val Trp Val He Ser Ala Val 
115 120 125 

He Ser Phe Pro Pro Leu He Ser He Glu Lys Lys Gly Gly Gly Gly 
130 135 140 



Tyr val He Ser Ser Cys He Gly Ser Phe Phe Ala Pro Cys Leu He 
165 170 175 

Trp Leu Val Tyr Val Arg He Tyr Gin He Ala Lys Arg Arg Thr Arg 
180 185 190 
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Val Pro Pro Ser Arg Arg Asp Pro Asp Ala Val Ala Ala Pro Pro Gly 
195 200 205 

Gly Thr Glu Arg Arg Pro Asn Gly Leu Gly Pro Glu Arg Ser Ala Glv 
210 215 220 

Pro Gly Gly Gly Arg Gly Arg Ser Ala Ser Gly Leu Pro Arg Arg Arg 
225 230 235 240 

Ala Gly Ala Gly Gly Gin Asn Arg Glu Lye Arg Phe Thr Phe Val lie 
245 250 255 

Ala Val Val lie Gly Val Phe Val Val Cya Trp Phe Pro Phe Phe Phe 
260 265 270 

Thr Tyr Thr Leu Thr Ala Val Leu Cya Ser Val Pro Arg Thr Leu Phe 
275 280 285 

Lys Phe Phe Phe Trp Phe Gly Tyr Cys Am Ser Ser Leu Asn Pro Val 
290 295 300 

lie Tyr Thr He Phe Asn His Asp Phe Arg Arg Ala Phe Lys Lys He 
305 310 315 32Q 

Leu Cys Arg Gly Asp Arg Lys Arg He Val 
325 330 

(2) INFORMATION FOR SEQ ID 110:22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: i34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Thr Leu Thr Leu Val Cys He Ala Gly Leu He Met Leu Ph<= Thr Val 
1 5 10 is 

Phe Gly Asn Val Leu Val He He Ala Val Phe Thr Ser Arg Ala Leu 
20 25 30 

Lys Ala Pro Gin Asn Leu Phe Leu Val Ser He Ala Ser Ala Asp He 
35 40 45 

Leu Val Ala Thr Leu Val He Pro Phe Ser Leu Ala Asn Glu Val Met 
50 55 go 

Tyr Trp Tyr Phe Gly Lys Val Trp Cys Glu He Tyr Leu Ala He Asp 
65 70 75 80 

Val Leu Phe Cys Thr Ser Ser He Val His Leu Cys Ala He Ser Leu 
85 go g5 

Asp Arg Tyr Trp Ser He Thr Gin Ala He Glu Tyr Asn Leu Ly B Arg 



Ala Val He Ser Phe Pro Pro Leu Leu He Ser He Glu Lys Lys Glv 
130 135 140 

Ala Gly Gly Gly Gin Gin Pro Ala Glu Pro Ser Cys Lys lie Aan Asp 
145 150 155 160 

Gin Lys Trp Tyr Val He Ser Ser Ser He Gly Ser Phe Phe Ala Pro 
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(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) length: 321 amino acids 

(B) TYPE: amino acid 

<C) Sthahdedhess : single 
(D) TOPOLOGY: linear 
(ii> MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Leu Leu Thr Ala Leu Val Leu Ser Val He lie Val Leu Thr lie lie 
15 10 15 

Gly Asn lie Leu Val lie Leu Ser Val Phe Thr Tyr Lys Pro Leu Arg 
20 25 30 

He val Gin Asn Phe Phe lie Val Ser lie Ala Val Ala Asp Leu Thr 
35 40 45 

Val Ala Leu Leu Val Leu Pro Phe Trp Ala Tyr Ser He Leu Gly Arg 
50 55 60 

Trp Glu Phe Gly He His Leu Cys Lys Leu Txp Leu Thr Cye Asp Val 
65 70 75 80 

Leu Cys Cye Thr Ser Ser He Leu Asn Leu CyB Ala He Ala Leu ABp 
85 90 95 



Leu He Ser Ser sro Pro Leu He Gly Trp Asn Asp Trp Pro Asp Glu 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 373 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Arg lie Leu Thr Ala Cys Phe Leu Ser Leu Leu lie Leu Ser Thr Leu 
15 10 15 

Leu Gly Asn Thr Leu Val Cys Ala Ala Vol lie Arg Phe Arg His Leu 
20 25 30 

Arg Ser Lys Val Thr Abu Phe Phe Val He Ser Leu Ala Val Ser Asp 
35 40 45 

Leu Leu Val Ala Val Leu Leu Trp Lye Ala Val Ala Glu lie Ala Gly 
50 55 60 

Phe Trp Pro Phe Gly Ser Phe Cys Asn He Trp Val Ala Phe Asp He 
65 70 75 80 

Met Cys Ser Thr Ala Ser He Leu Asn Leu Cys Val He Ser Val Asp 
85 90 95 

Arg Tyr Trp Ala He Ser Ser Pro Phe Arg Tyr Glu Arg Lys Lys Arg 
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(2) INFORMATION FOR SEQ ID HO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: peptide 



Gly Asn Val Leu Val Cys Ala Ala He Val Arg Ser Arg His Leu Leu 
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- 78 - 

Val Phe lie Val Ser He Ala Val Ser Asp Leu Phe Val Ala Leu Leu 
35 40 45 

Val Asn Thr Trp Lys Ala Tyr Ala Glu Val Ala Gly Tyr Trp Pro Phe 
50 55 SO 

Gly Ala Phe Cys Asp Val Trp Val Ala Phe Asp He Met Cys Ser Thr 

65 70 75 80 

Ala Ser He Leu Asn Leu Cys Val He Ser Val Asp Arg Tyr Trp Ala 
85 90 95 



His Phe Cys Ser Arg Thr Pro Val Glu Thr Val Asn He Ser Asn Glu 
305 310 315 320 



(2) INFORMATION FOR SEQ ID N0:2S: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
Sii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Tyr Asn Tyr Tyr Ala Thr Leu Leu Thr Leu Leu He Ala Vai He Val 
15 10 15 

Phe Gly Asn Val Leu Val Cys Met Ala Val Ser Arg GLu Lys Ala Leu 
20 25 30 

Gin Thr Met Asn Tyr Leu He Val Ser He Ala Val Ala Asp Leu Leu 
35 40 45 

Val Ala Thr Leu Val Trp Trp Trp Tyr Leu Glu Val Val Gly Glu Trp 
50 55 60 

Lys Phe Ser Arg He His cys Asp He Phe Val Thr Leu Asp He Thr 
65 70 75 80 

Ala Ser He Leu Asn Leu Cys Ala He Ser He Asp Arg Tyr Thr Ala 
85 90 95 



45 (2) INFORMATION FOR SEQ ID NO: 27: 
<i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 317 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID KO:27: 

Ala Tyr Tyr Ala Leu Ser Tyr Cys Ala Leu He Leu Ala He Val Phe 
15 10 15 

Gly Asn Gly Leu Val Cys Met Ala val Leu Arg Glu Lys Ala Leu Gin 
20 25 30 

Thr Thr Thr Asn Tyr Leu Val Val Ser Leu Ala Val Ala Asp Leu Leu 
35 40 45 

Val Ala Thr Leu val Trp Trp Val val Tyr Leu Glu Val Thr Gly Gly 
50 55 GO 

Val Trp Asn Phe Ser Arg He Cys Cys Asp Val Phe val Thr Leu Asp 
65 70 75 80 

Val Met Met Thr Ala Ser He Leu Asn Leu Cys Ala He Ser He Aep 
85 90 95 



Leu Pro ABp Pro Ala His Leu Glu Leu LyG Arg Ser Asn Gly Arg Leu 
210 215 220 
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12) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



10 15 

Ala Gly Asn Ser Leu Val Cys Val Ser Val Ala Thr Glu Arg Ala Leu 
20 25 30 

Gin Thr Pro Thr Asn Ser Phe lie Val Ser Leu Ala Ala Ala Asp Leu 
35 40 45 

Leu Leu Ala Leu Leu Val Leu Pro Leu Phe Val Tyr Ser Glu Val Gin 
50 55 60 

Gly Ala Ala Trp Leu Leu Ser Pro Arg Leu Cys Asp Val Met Leu Cys 

65 70 75 BO 

Thr Ala Ser lie Phe Asn Leu Cys Ala He Ser val Asp Ar- Phe Val 
85 90 95 



Asn Val Phe Arg Lys Ala Leu Arg Ala Cys Cys 
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- 82 - 

305 310 315 

(2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 327 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : peptide 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Lys lie Ser Leu Ala Val Val Leu Ser Val lie Thr Leu Ala Thr Val 
15 10 15 

Leu Ser Asn Ala Phe Val Leu Thr Arg lie Leu Leu Thr Arg Lys Leu 
20 25 30 

His Thr Pro Ala Asn Tyr Leu lie Gly Ser He Ala Thr Thr Asp Leu 
35 40 45 

Leu Val Ser He Leu Val Trp He Ser He Ala Tyr Thr He Thr His 
50 55 60 

Thr Trp Asn Phe Gly Gin He Leu Cys Asp He Trp Leu Ssr Ser Asp 
65 70 75 80 

He Thr Cys Cys Thr Ala Ser He Leu His Leu Cys Val He Ala Leu 
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(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32S amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



Gly Asn Ala Cys Val Val Ala Ala He Ala Leu Glu Arg Ser Leu Gin 
20 25 30 

Asn Val Ala Asn Tyr Leu lie Gly Ser Leu Ala val Arg Asp Leu Met 
35 40 45 

Val Ser Val Leu Val Leu Pro Met Ala Ala Leu Tyr Gin Val Leu Asn 
50 55 SO 

Lys Trp Thr Leu Gly Gin Val Thr Cys Asp Leu Phe He Ala Leu Asp 
65 70 75 80 

Val Leu Cys Cys Thr Ser Ser He Leu His Leu Cys Ala He Ala Leu 



Thr Pro Arg Pro Arg Ala Leu He Ser Leu Thr Trp Leu He Gly Phe 
115 120 125 



Ser Asp Pro Asp Ala Cys Thr He Ser Lys Asp His Gly Tyr Thr He 
145 150 155 160 



Leu Tyr Gly Arg He Phe Arg Ala Ala Arg Phe Arg He Arg Lye Thr 
180 185 190 



Pro Ala Pro Gin Pro Lys Lys Ser Val Asn Gly Glu Ser Gly Ser Arg 
210 215 220 



Phe He Leu Cys Trp Leu Pro Phe Phe He Val Ala Leu Val Leu Pro 

260 2G5 270 

Phe Cys Glu Ser Ser Cys His Met Pro Thr Leu He Arg Ala lie He 



WO 94/05695 



PCT/US93/08528 



(2) INFORMATION FOR SEQ ID NO; 31: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 385 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Gin Aan Trp Pro Ala Leu Ser He Val Val He He He Asn Thr He 
15 10 15 

Gly Gly Asn He Leu Val He Met Ala Val Ser Lys Lys Leu His Asn 
20 25 30 

Ala Thr Asn Tyr Phe Leu Met Ser He Ala tie Ala Asp Me» Leu Val 
35 40 45 

Gly Phe Leu Val Trp Leu Ser Leu Leu Ala He Leu Tyr Asp Tyr Val 
50 55 SO 

Trp Pro Leu Pro Arg Tyr Leu Cys Pro Val Trp He Ser Leu Asp Val 
65 70 75 80 

Leu Phe Ser Thr Ala Ser He Met His Leu Cys Ala He Ser Leu Asp 
85 90 55 
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(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 379 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii> MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Lys Aen Trp Ser Ala Leu Leu Thr Thr Val Val lie lie Leu Thr He 
15 10 15 

Ala Gly Asn He Leu Val He Met Ala Val Ser Leu Glu Lys Lys Leu 
20 25 30 

Gin Asn Ala Thr Asn Tyr Phe Leu Met Ser Leu Ala He Ala Asp Met 
35 40 45 

Leu Leu Gly Phe Leu Val Trp Val Ser Asn Glu Thr He Leu Tyr Gly 
50 55 60 

Tyr Arg Trp Pro Leu Pro Ser Lys Leu Cys Ala He Trp He Tyr Leu 
S5 70 75 80 

Asp Val Leu Phe Ser Thr Ala Ser He Met His Leu Cys Ala He Ser 
85 90 95 
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(2> INFORMATION FOR SEQ ID NO: 33: 
(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 amino acids 

(B) TYPE: amino acid 

(C) STSANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



15 10 15 

Gly Aan Val Val Val Cys He Ala Val Gly lie Asn Arg Arg Leu Arg 
20 25 30 

Asn Leu Thr Asn Cys Phe He val Ser Leu Ala He Thr Asp Leu Leu 
35 40 45 

Leu Gly Leu Leu Val Leu Pro Phe Ser Ala He Tyr Gin Leu Ser Cys 
50 55 60 

Lys Trp Ser Phe Gly Lys Val Phe Cys Asn He Tyr Thr S&r Leu Asp 
65 70 75 80 

Val Met Leu Cys Thr Ala Ser He Leu Asn Leu Leu He Ser Leu Asp 
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Gin Val Trp Ser Gly Thr Glu Val Thr Ala Pro Gin Gly Ala Thr Asp 



(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



Gly Asn Gly Lbu Val He Trp Val Ala Gly Phe Arg Met Thr Hie Thr 



Val Thr Thr He Ser Tyr Leu Asn Leu Ala Val Ala Asp Phe Cys Phe 
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Thr Ser Thr Leu Pro Phe Phe Met Val Arg Leu Gly Hie Trp Pro Phe 

50 55 60 

Gly Trp Phe Leu Cys Lys Phe Leu Phe Thr lie Val Asp lie Asn Leu 
65 70 75 80 

Phe Gly Ser Val Phe Leu He Ala Leu He Ala Leu Asp Arg Cys Val 



(2) INFORMATION FOR. SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 

{A} LENGTH: 304 amino acids 
{B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Asp He Leu Ala Leu Val He Phe Ala Val Val Phe Leu Vai Gly Val 



Leu Gly Asn Ala Leu Val Val Trp Val Thr Ala Phe Glu Ala Lys Arg 
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20 25 30 

Thr lis Asn Ala. lie Trp Phe Leu Asn lie Ala Val Ala. Asp Phe Leu 
35 40 45 

Ser Cys Leu Ala Leu Pro lie Leu Phe Thr Ser He Val Gin His His 
50 55 60 

His Trp Pro Phe Gly Gly Ala Ala Cys Ser He Leu Pro Ser Leu He 
65 70 75 SO 

Leu Leu Aen Met Tyr Ala Ser He Leu Leu Leu Ala Thr He Ser Ala 
85 90 95 



(2) INFORMATION FOR SEQ ID MO: 36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 322 amino acids 

(B) TYPE : amino acid 

40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: peptide 



Pro Leu Asn He Met Ala He Val Val Phe He Leu Lys Mat Lys Val 
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20 25 30 

Lys Lys Pro Ala Val His lie Ala Thr Ala Asp Val Leu Phe Val Ser 
35 40 45 

Val Leu Pro Phe Lys lie Ser Tyr Tyr Phe Ser Gly Ser Asp Trp Gin 
50 55 60 

Phe Gly Ser Glu Leu Cys Arg Phe Val Thr Ala Ala Phe Tyr Cys Asn 
65 70 75 80 

Met Tyr Ala Ser lie Leu Leu lie Ser lie Asp Arg Phe He Ala Val 
85 90 95 



He Cys Hie Asp Val Leu Asn Glu Thr Leu Leu Glu Gly Tyr Tyr Ala 
145 150 155 160 

Tyr Tyr Phe Ser Ala Phe Ser Ala Val Phe Phe Phe Val Pro Leu He 
1S5 170 175 



Ala Val Ala Asn Arg Ser Lys Lys Ser Arg Thr Asn Arg Cys Phe Asn 
195 200 205 



He Cys Phe Gly Pro Thr Trp Leu Leu He Ala His Tyr Ser Phe Leu 
225 230 235 240 



40 (2) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 
IB) TYPE: amino acid 
(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



WO 94/05695 



PCT/US93/08528 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

Tyr He Asn Thr Val He Ser Cys Thr He Phe He Val Gly Trp Gly 
1 5 10 15 

Asn Ala Thr Leu Leu Arg He He Tyr Gin Asn Lya Cys Met Arg Asn 
20 25 30 

Gly Pro Asn Ala Leu He Ala Ser lie Ala Leu Gly Asp Leu He Tyr 

35 40 45 

Val Val He Asp Leu Pro He Asn Val Pro Lys Leu He Ala Gly Arg 
50 55 60 

Trp Pro Phe Glu Gin Asn Asp Phe Gly Val Phe Cys Lys Phe Met Gly 
65 70 75 80 

Val Val Met He Phe Phe Gly Leu Ser Pro Leu Leu Leu Gly Ala Ala 
85 90 95 



(2) INFORMATION FOR SEQ ID N0:3B: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 312 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 
|D> TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

Ixi) SEQUBNCE DESCRIPTION: SEQ ID NO: 38: 

Lys Tyr Phe Val Val He lie Tyr Ala Leu Va.1 Phe Leu Leu Ser Leu 
1 5 * 10 15 

Leu Gly Asn Ser Leu Val Met Leu Val He Leu Tyr Ser Arg Gly Val 
20 25 30 

Arg Ser Val Thr lie Val Tyr Leu Leu Asn lie Ala He Ala Asp Leu 
35 40 45 

Leu Phe Ala Leu Thr Leu Pro He Trp Ala Ala Ser Lys Vai Asn Gly 
50 55 SO 

Trp He Phe Gly Thr Phe Leu Cys Lys Trp Ser Leu Leu Lys Glu Val 
65 70 75 80 

Asn Phe Tyr Ser Gly He Leu Leu Leu Ala Cys He Ser Val Asp Arg 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 39: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 326 amino acids 

(B) TYPE: amino acid 
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<C) 3TRANDEDNESS : single 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Leu Phs Pro lie Val Tyr Ser lie lie Phe Val Leu Gly lie lie Ala 

15 10 15 

ABn Gly Tyr Val Leu Trp Val Phe Ala Arg Leu Tyr Pro Ser Lys Lys 
20 " 25 30 

Asn Glu He Lys He Phe Met Val Asn Leu Thr val Ala Asp Leu Leu 
35 40 45 

Phe Leu He Thr Leu Pro Leu Trp lie Val Tyr Tyr Ser Asn Gin Gly 

50 55 60 

Asn Trp Phe Leu Pro Lys Phe Leu Cys Asn Leu Ala Gly Cys Leu Phe" 
65 70 75 80 

Phe He Asn Thr Tyr Cys Ser Val Ala Phe Leu Gly Val He Thr Tyr 
85 90 95 



Val Asn Pro He Lya Asn 
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325 

(2) INFORMATION FOR SEQ ID »O:40: 
(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 333 amino acids 
<B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

Tyr He Asn Thr He Val Ser Cys Leu val Phe Val Leu Gly He He 
15 10 15 

Gly Asn Ser Thr Leu Leu Arg He He Tyr Lys Asn Lys Cys Met Arg 
20 25 30 

Aan Gly Pro Asn He Leu He Ala Ser He Ala Leu Gly Asp Leu Leu 
35 40 45 

His He He He Asp He Pro He Met Ala Tyr Lys Leu He Ala Gly 
50 55 60 

Asp Trp Pro Phe Ala Cys Lys Leu Phe Pro Phe Leu Gin Lys Ser Ser 
65 70 75 80 

Val Gly He Thr Val Leu Asn Leu Cys Ala Leu Ser Val Asp Arg Tyr 
85 90 95 
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(2) INFORMATION FOR SEQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 350 amino acids 

(B) TYPE: amino acid 

(C) STRARDKDNESS : single 

(D) TOPOLOGY: linear 
<ii) MOLECOLB TYPE: peptide 

ixx) SEQUENCE DESCRIPTION : SEQ ID NO: 41: 

Leu lie Ala Ser Pro Trp Phe Ala Ala Ser Phe Cys Val Val Gly Leu 
1 5 10 15 

Ala Ser Asn Leu Leu Ala Leu Ser Val Leu Ala Gly Ala Arg Gin Ser 

20 25 30 

Ser Ser His Thr Arg Ser Ser Phe Leu Thr Phe LeU Cys Gly Leu val 
35 40 45 

Leu Thr Leu Asp Phe Leu Gly Leu Leu Val Thr Gly Thr II* Val Val 
50 55 60 

Ser Gin His Ala Ala Leu Phe Glu Trp His Ala Val Asp Pro Gly Cys 
65 70 75 80 

Arg Leu Cys Arg Leu val Pro Phe lie Gin Lys Ala Ser Val Gly lie 
85 90 95 



Leu Val Leu Asp Tyr lie Gly lie Asn Met Ala Ser lie Asn Ser Cys 
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10 (2) INFORMATION FOR SEQ ID NO:42: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 328 amino acids 
<B) TYPE: amino acid 
(C) STRANDEDKESS : Single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 42: 

lie Tyr Val lie Pro Ala Val Tyr Gly Leu He He Val He Gly Leu 
1 5 10 15 

20 He Gly Asn He Thr Leu He Lys He Phe Cys Thr Val Lyj Ser Leu 

20 25 30 

Asn Leu Phe He Ser Ser He Ala Leu Gly Asp Leu Leu Leu Leu val 
35 40 45 

Thr He Cys Ala Pro Val Asp Ala Ser Lys Tyr He Ala Asp Arg Trp 
25 50 55 60 

Leu Phe Gly Arg He Gly Cys Lys Leu He Pro Phe He Gin Leu Thr 
65 70 75 80 

Ser Val Gly Val Ser Val Phe Thr Leu Thr Ala Leu Ser Ala Asp Arg 
85 90 95 
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Phe Ala Leu Tyr Leu Leu Ser Lye Ser Phe Arg Gin Phe Asn Thr Gin 
275 280 2S5 

Leu Leu Cys Cys Gin Pro Gly Leu Ser His Ser Thr Gly Arg Ser Leu 
290 295 300 

Ser Phe Lye Ser Thr Asn Pro Ser Ala Thr Phe Ser Leu He Asn Arg 



(2) INFORMATION FOR SEQ ID NO.-43: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 345 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS i single 

(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: peptide 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO:43: 

Cys Val He Pro Ser Ser Leu Tyr Leu He He He Ser Val Gly Leu 
IS 10 15 

Leu Gly Aan He Met Leu Val Lys He Phe Leu Thr Asn Ser Thr Met 
20 25 30 

Arg Ser Val Pro Asn He Phe He Ser Asn He Ala Ala G1-* Asp Leu 
35 40 45 

Leu Leu Leu Leu Thr Cys Val Pro val Asp Ala Ser Arg Tyr Phe Phe 
50 55 60 

ABp Glu Trp Val Phe Gly Lys Leu He Gly Cys Lys Leu He Pro Ala 
55 70 75 80 

He Gin Leu Thr Ser Val Gly Val Ser Val Pro Thr Leu Thr Ala Leu 
85 90 95 



Glu His Thr Lys Lys Gin Met Glu Thr Arg Lys Arg Leu Ala Lys He 
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val Leu Val Phe val Gly Cys Phe Val Phe Cys Trp Phe Pro Asn His 



Arg Ser Thr Ser Tyr Leu Leu Ser Ser Ser Ala Val Trp Arg Ser Lau 



(2> INFORMATION FOR SEQ ID NO: 44: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 316 amino acids 

(B) TYPE : amino acid 

(C> STRANDEDNESS : single 
(D) TOPOLOGY i linear 
(ii) MOLECULE TYPE: peptide 



10 15 

Leu Ala Asn Ser Val Val Val Trp Val Asn He Gin Ala Lys Thr Thr 
20 25 30 

Gly Tyr Asp Thr Hie Cys Tyr lie Lau Asn Leu Ala He Ala Asp Leu 
35 40 45 

Trp Trp Leu Thr lie Pro Val Trp Trp Ser Leu Val Gin His Asn Gin 
50 55 60 

Trp Pro Met Gly Glu Leu Thr Cys Lys Val Thr His Leu lie Phe Ser 
65 70 75 80 

He Asn Leu Phe Ser Gly He Phe Phe Leu Thr Cys Met Ser Val Asp 
85 90 95 
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(2) I NFORMATION FOR SEQ ID 110:45: 
{i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 353 amino acide 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii> MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 45: 

Lys Val Leu Val Thr Ala lie Tyr Leu Ala Leu Phe Val Va;. Gly Thr 
15 10 15 

Val Gly Asn Ser Val Thr Ala she Thr Leu Ala Arg Lys Lys Ser Leu 
20 25 30 

Gin Ser Leu Gin Ser Thr Val Hie Tyr Hie Leu Ser Sar Leu Ala Lsu 
35 40 45 

Ser Asp Leu Leu He Leu Leu Trp Val Glu Leu Tyr Asn Phe He Trp 
50 55 60 

His Hie Pro Trp Ala Phe Gly Asp Ala Gly Cys Arg Gly Tyr Tyr. Phe 
65 70 75 80 

Leu Arg Asp Ala Cys Thr Tyr Ala Thr Ala Leu Asn val Ala Ser Leu 
85 90 95 



He Val Asp Thr Ala Thr Val Lya Val Val He Gin Val Asn Thr Phe 
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(2) INFORMATION FOR SEQ ID NO: 46: 
(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 316 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNBSS : single 
(P) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



Glu Asn lie Phe Val Leu Ser Val the Cys Leu His Lys Thr Asn Cys 
20 25 30 

Thr Val Ala Glu lie Tyr Leu Gly Asa lie Ala Ser Ala Asp Leu He 
35 40 45 

He Ala Cys Gly Leu Pro Phe Trp Ala He Thr He Ala Asn Asn Phe 
50 55 60 

Asp Trp Leu Phe Gly Glu Val Leu Cys Arg Val Val Asn Leu Tyr Met 
65 70 75 80 

Asn Leu Tyr Ser Ser He Cys Phe Leu Val Ser He Asp Arq Tyr Leu 
85 90 95 
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Val Tyr Pro Ser Arg Ser Trp Glu Val Phe Leu Leu Asn Leu Val Gly 
145 150 155 ISO 



Val Leu Arg Asn Asn Glu Met Lys Lys Phe Lys Glu Val Gin Thr Glu 
ISO 185 190 



Leu Gly Val Leu Ser Gly Cys Trp Asn Glu Arg Ala Val Asp lie Val 
225 230 235 240 

Arg Gin He Ser Ser Tyr Val Ala Tyr Ser Asn Ser Cys Leu Asn Pro 
245 2S0 255 



He His Lys Leu Gin Asp Trp Ala Gly Asn Lys Gin 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 47: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 347 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

He Leu Leu Val Val He He Cys Gly Leu Gly He Val Gl> Asn He 
IS 10 15 

Met Val Val Leu Val Val Met Arg Thr Thr Pro Thr Asn Cys Tyr Leu 
20 25 30 

Val Ser He Ala Val Ala Asp Leu Met Val Leu Val Ala Ala Gly Leu 
35 40 45 

Pro Asn He Thr Asp Ser He Tyr Gly Ser Trp Val Tyr Gly Tyr Val 
50 55 ' 60 

Gly Cys Leu Cys He Thr Tyr Leu Gin Tyr Leu Gly He Asn Ala Ser 
65 70 75 80 

Ser Cys Ser He Thr Ala Phe Thr He Glu Arg Tyr He Ala He Cys 
85 90 95 

His Pro He Lys Ala Gin Phe Leu Cys Thr Phe Ser Arg Ala Lys Lys 
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- 102 - 
105 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 341 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



10 15 

Thr Gly Asn Ala lie Val lie Trp lie lie Leu Ala His Arg Arg Met 
20 25 30 

Arg Thr Val Thr Aen Tyr Phe He Val Asn He Ala Leu Ala Asp Leu 
35 40 45 

Leu Asn Ala. Ala Phe Asn Phe Val Tyr Ala Ser His Asn lis Trp Tyr 
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50 55 60 

Phe Gly Arg Ala Phe Cys Tyr Phe Gin Asn Leu Phe Pro lie Thr Ala 
65 70 75 80 

Met Phe Val Ser lie Tyr Ser Met Thr Ala lie Ala Ala Asp Arg Tyr 



(2) INFORMATION FOR SEQ ID NO: 49: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

He Val Leu Trp Ala Ala Ala Tyr Thr Val He Val Val Arg Ser Val 
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Val Gly Asn Val Val Val lie Trp lie lie Leu Ala His Lys Arg Met 
20 25 30 

Arg Thr Val Thr Asn Tyr Phe Leu Val Asn lie Ala Phe Ala Phe Ala 
35 40 45 

Leu Asn Thr Trp Asn Phe Thr Tyr Ala Val Hie Asn Val Trp Tyr Tyr 

50 55 60 

Gly Leu Phe Tyr Cye Lys Phe His Asn Phe Phe Pro lie Ala Ala Leu 
S5 70 7S SO 

Phe Ala Ser He Tyr Ser Met Thr Ala Val Ala Phe Asp Arg Tyr Leu 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 50: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 amino acids 

(B) TYPE: amino acid 



WO 94/05695 



PCT/US93/08528 



(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

He Ala Leu Trp Ser Leu Ala Tyr Gly Leu- Val Val Ala Val Ala Val 
1 5 10 15 

Phe Gly Asn Leu He Val He Trp He He Leu Ala His Lys Airg Met 
20 25 30 

Arg Thr Val Thr Asn Tyr Phe Leu Val Asn Leu Ala Phe Ser Asp Ala 
35 40 45 

Ser Val Ala Ala Phe Asn Thr Leu He Asn Phe He Tyr Gly Leu His 
50 55 60 

Ser Glu Trp Tyr Phe Gly Ala Asn Tyr Cys Arg Phe Gin Asn Phe Phe 
65 70 75 80 

Pro He Thr Ala Val Phe Ala Ser He Tyr Ser Met Ala He Ala Val 
85 90 95 



Cys Tyr Val Trp Pro Glu Gly Pro Lys Gin His Phe Thr Tyr His He 
145 150 155 160 



Cys Asp Lys Tyr His Glu Gin Leu Lys Ala Lys Arg Lys Val Val Mat 
195 200 205 



Tyr Phe He Leu Thr Ala He Tyr Gin Gin Leu Asn Arg Trp Lys Tyr 

225 230 235 240 

He Gin Gin Val Tyr Leu Ala Ser Phe Trp Leu Ala Met Ser Ser Thr 

245 2S0 255 



Ser Ser Leu Tyr Thr Val Ser Phe Met Ser Val Thr Val Leu Phe Asp 

305 310 315 320 

Pro Asn Asp Gly Asp Pro Thr Lvs Ser Ser Arg Lys LyB Arg Ala Val 

325 330 335 
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(2) INFORMATION FOR SEQ ID HO: 51: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 325 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO:51: 

Met lie Pro Thr Leu Tyr Ser lie lie Phe Val Val Gly He Phe Gly 
15 10 15 

Asn Ser Leu Val Val lie Val lie Tyr Phe Tyr Met Lys Leu Lys Thr 
20 25 30 

Tyr Ala Ser Val Phe Leu Leu Asn Leu Ala Leu Ala Asp Leu Cys Phe 
35 40 45 

Leu Leu Thr Leu Pro Leu Trp Ala val Tyr Thr Leu Tyr Arg Trp Pro 
50 55 60 

Phe Gly Asn Tyr Leu Cys Lys He Ala Ser Ala Ser Val Ser Phe Asn 
65 70 75 80 

Leu Tyr Ala Ser Val Phe Leu Leu Thr Cys Leu Ser He Asp Arg Tyr 
85 90 95 

Leu Ala He Val His Pro Met Lys Ser Arg Leu Arg Arg Leu Val Ala 
100 105 110 



Leu Gly Leu Thr Lys Asn He Leu Gly Phe Leu Phe Pro Phe Leu He 
165 170 175 

He Leu Thr Ser Tyr Thr Leu He Trp Lys Thr Leu Lys Lys Ala Tyr 
180 185 ISO 



Phe Thr Phe Met Val Leu He Gin Leu Gly Leu He Arg Asp Cys Lys 
225 230 235 240 



Tyr Phe Gin Gin Asn Leu Asn Pro Leu Phe Tvr Gly Phe Leu Gly Lys 
260 265 270 
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(2) INFORMATION FOR SEQ ID NO: 52: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 282 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

He Val Hie Trp Val He Met Ser He Ser Pro Val Gly Phe Val Glu 
15 10 15 

Asn Gly He Leu Leu Trp Phe Leu Cys Phe Phe Thr Val Tyr Thr His 
20 25 30 

Leu Ser He Ala Asp He Ser Leu Leu Phe Cys He Phe He Leu Ser 
35 40 45 

He Asp Tyr Ala Leu Asp Tyr Glu Leu Ser Ser Gly His Tyr Tyr Thr 
50 55 60 

He Val Thr Leu Ser Val Thr Phe Leu Phe Gly Tyr Asn Thr Gly Leu 
65 70 75 80 

Tyr Leu Leu Thr Ala He Ser Val Glu Arg Cys Leu Ser Val Leu Tyr 
85 90 95 



Cys Asn Thr val Thr Val Glu Thr val Val 
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275 280 

(2) INFORMATION FOR SEQ ID SO: 53: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDHESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Tyr Asp Phe Leu Arg Val Leu He Trp Leu He Asn He Lew Ala He 
15 10 IS 

Met Gly Asn Val Met Thr Leu Phe Val Leu Leu Thr Ser Arg Tyr Lys 
20 25 30 

Leu Thr Val Pro Arg Phe He Met Asn Leu Ser Phe Ala Asp Phe Cys 
35 40 45 

Met Leu Tyr Leu Leu Leu lie Ala Ser Val Asp Ser Gin Thr Lys Gly 
50 55 60 

Gin Tyr Tyr Asn His Ala He Asp Trp Gin Thr Gly Ser Gly Cys Ser 
65 70 75 80 

Thr Ala Gly Phe Phe Thr Val Leu Ala Ser Glu Leu Ser Val Tyr Thr 
85 90 95 



Val Ser Asn Tyr Met Lys Val Ser He Cys Leu Pro Met Val Glu Thr 
145 150 155 0.60 

Thr Leu Ser Gin Val Tyr He Leu Thr He Leu He Leu Asn Val Val 
165 170 175 



AXa He Ser Ala Ala Phe Lys Val Pro Leu He Val Thr Asn Ser Lys 
225 230 235 240 

Val Leu Leu Val Leu Phe Tyr Pro He Asn Ser Cys Ala Asn Pro Phe 
245 250 255 

Leu Tyr Ala He Phe Thr Lys Thr Phe Gin Arg Asp Phe Phe He Leu 
260 2G5 270 



WO 94/05695 



PCI7US93/08528 



(2) INFORMATION FOR SEQ ID NO: 54: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 36 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Tyr Lys Phe Leu Arg lie Val Val Trp Phe Val Ser Leu Leu Ala Leu 
1 5 10 15 

Leu Gly Asn Val Phe Val Leu Leu He Leu Leu Thr Ser His Tyr Lys 
20 25 30 

Leu Asn Val Pro Arg Phe lie Met Asn He Ala Phe Ala Asp Phe Cys 
35 40 45 

Met Het Tyr Leu Leu Leu lie Ala Ser Val Asp Leu Tyr Thr His Ser 
50 55 60 

Glu Tyr Tyr Asn His Ala He Asp Trp Gin Thr Gly Pro Gly Cys Asn 
65 70 75 80 

Thr Ala Gly Phe Phe Thr Val Phe Ala Ser Glu Leu Ser Val Tyr Thr 
85 90 95 
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(2) INFORMATION FOR SEQ ID HO: 55: 
(i) SEQUENCE CHARACTERISTICS : 

{A> LENGTH: 327 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLE COLE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Tyr Asn He Leu Arg Val Leu He Trp Phe He Ser He Leu Ala He 
15 10 15 

Thr Gly Asn He He Val Leu Val He Leu Thr Thr Ser Gin Tyr Lye 
20 25 30 

Leu Thr Val Pro Arg Phe Leu Met Asn He Ala Phe Ala Asp Leu Cys 
35 40 45 

He Gly He Tyr Leu Leu Leu He Ala Ser Val Asp He His Thr Lys 
50 55 60 

Ser Gin Tyr His Asn Tyr Ala He Asp Trp Qln Arg Gly Ala Gly Cys 
65 70 75 80 

Asp Ala Ala Gly Phe Phe Thr Val Phe Ala Ser Glu Leu Ser Val Tyr 
85 50 95 



Met Gin He Asp Cys Lys Val Gin Leu Arg His Ala Ala Ser Val Met 
115 120 125 



Gly He Ser Ser Tyr Met Lys Val Ser He Cys Leu Pro Leu He Asp 
145 150 155 160 



Ala He Ser Ala Ser Leu Lys VaL Pro Leu He Thr Val Ser Lys Ala 
225 230 235 240 

Lys He Leu Leu Val Leu Phe His Pro He Asn Ser Cys Ala Asn Pro 
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(2) INFORMATION FOR SEQ ID NO: 56: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 309 amino acids 
(B> TYPE: amino acid 
(C) STRANDEDNESS : Single 
(D> TOPOLOGY: linear 
<ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Ser Mat Lau Ala Ala Tyr Met Phe Leu Leu lie val Leu Gly Phe Pro 

15 10 IS 

lie Asn Phe Leu Thr Leu Tyr Val Thr Val Gin His Lys Lys Leu Arg 
20 25 30 

Thr Pro lie Asn Tyr lie Leu Leu Asn Leu Ala val Ala Asp Leu Phe 
35 40 45 

Mat Val Leu Gly Gly Phe Thr Ser Thr Leu Tyr Thr Ser Leu His Gly 
50 55 60 

Tyr Phe Val Phe Sly Pro Thr Gly Cys Asn Leu Glu Gly Ph« Phe Ala 
65 70 75 30 

Thr Leu Gly Gly (Jlu He Ala Leu Trp Ser Leu Trp Leu Ala He Glu 
85 90 95 



Cys Ala Ala Pro Pro He Ala Gly Trp Ser Arg Tyr He Pro Glu Gly 
130 135 140 



Asn Asn Glu Ser Phe Val He Tyr Met Phe Val Val His Phe Thr He 
165 170 175 
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Gly Ser Asn Phe Gly Pro He Phe Met Arg He Pro Ala Phe Phe Ala 
245 250 255 



(2) INFORMATION FOR SEQ ID NO: 57: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 297 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Met He Phe Val Val He Ala Ser Val Phe Thr Asn Gly Leu Val Leu 
15 10 15 

Ala Ala Thr Met Lys Phe Lys Lys Leu Pro His Pro He Asn Trp He 
20 25 30 

Leu val Aen Leu Ala Val Ala Asp He Ala Gly Thr Val He Ala Ser 
35 40 45 

Thr He Ser Val Val Asn Gin Val Tyr Gly Tyr Phe Val Leu Gly His 
50 55 60 

Pro Met Cys Val Leu Glu Gly Tyr Thr Val Ser Leu Cys Gly He Thr 
S5 70 75 80 

Gly Leu Trp Ser Leu Ala He He Ser Trp Glu Arg Trp Met Val Val 
85 90 95 



Gly Pro Asp Val Phe Ser Gly Ser Ser Tyr Pro Gly Val Gin Ser Leu 
145 150 1SS 160 



Ala Phe Cys Phe Cys Trp Gly Pro Tyr Ala Phe Phe Ala Cys Phe Ala 
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Lys Val Asp Asp GLy Ser Glu Leu Ser Ser Ala Ser Lya Thr Glu Val 



(2) INFORMATION FOR SBQ ID NO: 58: 
(1) SEQUENCE CHARACTERISTICS: 

{A} LENGTH: 297 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



15 

Ala Ala Thr Met Lys Phe Lys Lys Leu Arg His Pro Leu Asn Trp lie 
20 25 30 

Leu Val Asn He Ala Val Ala Asp He Ala Gly Thr Val He Ala Ser 
35 40 45 

Thr He Ser He Val Asn Gin val Ser Gly Tyr Phe Val Leu Gly His 
50 55 SO 

Pro Mat Cys Val Leu Glu Gly Tyr Thr Val Ser Leu Cys Gly He Thr 
S5 70 75 SO 

Gly Leu Trp Ser Leu Ala He He Ser Trp Glu Arg Trp Leu Trp Cys 
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(2) INFORMATION FOR SEQ ID NO: 59: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 305 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : Linear 
(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Gin Ala Ala Phe Met Gly Thr Val Phe Leu lie Gly Phe Pro Leu Leu 
15 10 15 

Val Ala Thr Leu Ala Tyr Lys Lys Leu Arg Gin Pro Asn Tyr lie Leu 
20 25 30 

Val Asn Val Ser Phe Gly Gly Phe Leu Leu Cys He Phe Ser Val Phe 
35 40 45 

Pro Val Phe Val Ala Ser Cys Asn Gly Tyr Phe Val Phe Gly Arg His 
50 55 60 

Val CyB Ala Leu Glu Gly Phe Leu Gly Thr Val Ala Gly Leu Val Thr 
65 70 75 80 

Gly Trp Ser Leu Ala Phe Leu Ala Phe Glu Arg Tyr lie Val He Cys 
85 90 95 



Pro Asp Lys Tyr Thr Val Gly Thr Lys Tyr Arg Ser Glu Ser Tyr Thr 
145 150 155 160 



Ala Ala Phe Ala Met Tyr Met Val Asn Aan Arg Asn His Gly Leu Asp 
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(2) INFORMATION FOR SEQ ID NO: 60: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 293 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Leu He Tyr Gly Leu Phe Leu Ser Met Tyr Leu Val Thr Val He Gly 
15 10 15 

Abb He Ser He He val Ala He He Ser Asp Pro Cys Leu His Thr 
20 25 30 

Pro Met Tyr Phe Phe Leu Ser Asn Leu Ser Phe Val Asp He Cys Phe 
35 40 45 

He Ser Thr Thr Val Pro Val Asn Thr Gin Thr Gin Asn Asn Val He 
50 55 60 

Thr Tyr Ala Gly Cys He Thr Gin He Tyr Phe Phe Leu Leu Phe val 
65 70 75 80 

Glu Lau Asp Asn Phe Leu Leu Thr He Met Ala Tyr Asp Arg Tyr Val 
85 90 95 
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Pro Pro Ser Leu Leu His Phe Phe Leu Val Leu Cys His Leu Pro Cys 



(2) INFORMATION FOR SEQ ID NO: 61: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 294 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Leu Leu Phe Leu Leu Phe Leu lie Met Tyr Leu Ala Thr Val Leu Gly 
15 10 15 

Asn Leu Leu lie lie Leu Ala lie Gly Gly Asp Ser Arg Leu His Thr 
20 25 30 

Pro Met Tyr Phe Phe Leu Ser Asn Leu Ser Phe Val Asp Val Cys Phe 
35 40 45 

Ser Ser Thr Thr Val Pro LyB Val Leu Ala Asn His lie Leu Gly Ser 
50 55 60 

Gin Ala lie Ser Phe Ser Gly Cys Leu Thr Gin Leu Tyr Phe Leu Ala 
65 70 75 80 

Val Phe Gly Asn Met Asp Asn Phe Leu Leu Ala Val Met Ser Tyr Asp 



Thr Val He Ala Val Tyr Phe Asn Pro Ser Ser Ser His Leu Ala Gly 
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(2) INFORMATION FOR SEQ ID NO: 62: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 277 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY : linear 
(it) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 

Leu Leu Phe Leu Leu Phe Leu Val Met Tyr Leu Leu Thr Val Val Gly 
1 5 10 15 

Asn Leu Ala lie lie Ser Leu Val Gly Ala His Arg Cys Leu Gin Pro 
20 25 30 

His Thr Pro Met Tyr Phe Phe Leu Cys Asn Leu Ser Phe Leu Glu He 
35 40 45 

Trp Phe Thr Thr Ala Cys Val Pro Lys Thr Leu Ala Thr Phe Ala Pro 
50 55 60 

Arg Gly Gly Val He Ser Leu Ala Gly Cys Ala Thr Lys Tyr Phe Val 
65 70 75 80 

Phe Ser Leu Gly Cys Thr Glu Tyr Phe Leu Leu Ala Val Met Ala Tyr 



Lye Ala' He Thr Val Leu Asn Thr He Val Thr Pro Val Leu Asn Pro 
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250 



(2) INFORMATION FOR SEQ ID NO: 63: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



10 15 

Assn Leu Leu lie He Met Ala He lie Thr Gin Ser Hie Leu His Thr 
20 25 30 

Pro Met Tyr Phe Phe Leu Ser Phe val Asp He Cys Phe Thr Ser Thr 
35 40 45 

Thr lie Pro Leu Val Asn He Tyr Thr Gin Ser Lys Ser He Thr Tyr 
50 55 SO 

Glu Asp Cys He Ser Leu Val Phe Ala Glu Leu Gly Asn Phe Leu Leu 
65 70 75 B0 

Ala val Met Ala Tyr Asp Arg Tyr Val Ala Xaa Cys His Pro Leu Cys 
85 90 95 



WO 94/05695 



PCTVUS93/08528 



(2) INFORMATION FOR SEQ ID NO: 64: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Leu Phe Tyr Ala Leu Phe Leu Val Met Tyr Leu Thr Thr He Leu Gly 
1 5 10 15 

Asn Leu Leu He He Val Leu Val Gin Leu Asp Ser Gin Leu His Thr 
20 25 30 

Pro Met Tyr Leu Phe Leu Ser Asn Leu Ser Phe Ser Asp Leu Cys Phe 
35 40 45 

Ser Ser Leu Lys Leu Leu Gin Asn Met Arg Ser Gin Asp Thr Ser He 
50 ' 55 60 

Pro Tyr Gly Gly Cys Leu Ala Gin Thr Tyr Phe Phe Met Val Phe Gly 
65 70 75 80 

Asp Leu Ser Phe Leu Leu Val Ala Met Ala Tyr Asp Arg Tyr Val Ala 
85 90 95 



45 (2) INFORMATION FOR SEQ ID NO: 65: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2B6 amino acids 

(B) TYPE : amino acid 
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fC> STRANDEDNESS : single 
(D> TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Leu Leu Phe Phe Leu Ser Leu Leu Xaa Tyr Val Leu Val Leu Thr Glu 
1 5 10 15 

Asn Met Leu lie lie lie Ala lie Arg Asia Hie Pro Thr Leu His Lys 
20 25 30 

Pro Met Tyr Phe Phe Leu Phe Leu Glu lie Txp Tyr Val Thr Val Thr 
35 40 45 

lie Pro Lys Leu Met Gly Phe He Gly Ser Lys Glu Asn His Gly Gin 
50 55 60 

Leu He Ser Phe Phe Ala Cys Met Thr Gin Leu Tyr Phe Phe Leu Gly 
G5 70 75 80 

Leu Gly Cys Thr Glu Cys Val Leu Leu Ala Val Met Ala Tyr Asp Arg 
85 90 95 



(2) INFORMATION FOR SEQ ID 110:66: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 275 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 
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1 5 10 .15 

Asn Leu Leu lie Val Val Leu Val Gin Leu Asp Ser His Leu His Thr 
20 25 30 

Pro Met Tyr Leu Phe Leu Ser Asn Leu Ser Phe Ser Asp Leu Cys Phe 
35 40 45 

Ser Ser Val Thr Met Leu Lys Leu Leu Gin Asn lie Gin Ser Gin Val 
50 55 60 

Pro Ser He Ser Tyr Ala Gly Cys Leu Trp He Phe Phe Phe Leu Leu 
65 70 75 80 

Phe Gly Tyr Leu Gly Asn Phe Leu Leu Val Ala itet Ala Tyr Asp Arg 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 67: 
U) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 2B4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNHSS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Leu Phe Tyr Ala Leu Phe Leu Ala Met Tyr Leu Thr Thr Leu Leu Gly 
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Asn Leu He He He He Leu He Leu Leu Asp Ser His Leu His Thr 
20 25 30 

Pro Met Tyr Leu Phe Leu Ser Asa Leu Ser Phe Ala Asp Leu Cys Phe 
35 40 45 

Ser Ser Leu Lys Leu Leu Gin Asn Met Gin Ser Gin Val Pro Ser He 

50 55 60 

Pro Tyr Ala Gly Cys Leu Ala Gin lie Tyr Phe Phe Leu Phe Phe Gly 
65 70 75 80 

Asp Leu Gly Asn Phe Leu Leu Val Ala Met Ala Tyr Asp Arg Tyr Val 
85 90 95 



Glx Val Ser Leu Val Val Leu Ser Trp Val Leu Thr Thr Phe His Ala 
115 120 125 



Val He Pro His Tyr Phe Cys Asp Met Ser Thr Leu Leu Lye Val Ala 

145 150 155 160 

Cys Ser Asp Thr His Asp Asn. Glu Leu Ala He Phe He Leu Gly Gly 

165 170 175 



He Val Ser Ser He Phe Lys Val Pro Ser Ser Gin Ser He His Lys 
195 200 205 



Gly Thr Val He Gly Leu Tyr Leu Cys Pro Ser Ala Asn Asn Ser Glu 
225 230 235 240 

Val Lys Glu Thr Val Met Ser He Tyr Thr Met Val Pro Met Leu Asn 
245 250 255 

Pro Phe He Tyr Ser Leu Arg Asn Arg Asp He Lys Asp Ala Leu Glu 
2S0 265 270 



35 (2) INFORMATION FOR SEQ ID NO: 68: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 277 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 
4 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Leu Phe Tyr Ala Leu Phe Leu Ala Met Tyr Leu Thr He He Leu Gly 
1 5 10 15 

45 Asn Leu Leu He He Val Leu Val Arg Leu Asp Ser His Leu His Met 



WO 94/05695 



PCT/US93/08528 



- 12 3 - 

Tyr Leu Phe Leu Ser Asn Leu Ssr Phe Ser Asp Leu Cys Phe Ser Ser 
35 40 " 45 

Val Thr Trp Lys Leu Leu Gin Asn Met Gin Ser Gin Val Pro Ser He 
50 55 £0 

Ser Tyr Thr Gly Cys Leu Thr Gin Leu Tyr Phe Phe Met Val Phe Gly 

65 70 75 80 

Asp Trp Ser Phe Leu Leu Val Val Met Ala Tyr Asp Arg Tyr Val Ala 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 69: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 amino acids 
<B) TYPE: amino acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Val Phe Tyr Ala Leu Phe Leu Ser Met Tyr Leu Thr lie Vai Leu Gly 
15 10 15 

Asn Leu He He He He Leu He His Leu Asp Ser His Leu His Thr 



45 Pro Met Tyr Leu Phe Leu Ser Asn Leu Ser Phe Ser Asp Leu Cys Phe 

35 40 45 
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- 124 - 

Ser Ser Leu Lys Leu Leu Gin Asn Met Gin Ser Gin Val Pro Ser lie 
50 55 60 

Pro Phe Ala Gly Cys Leu Thr Gin Leu Tyr Phe Tyr Leu Tyr Phe Ala 
65 70 75 80 

Asp Leu Glu Ser Phe Leu Leu Val Ala Met Ala Tyr Asp Arg Tyr Val 



Cys Leu 

(2) INFORMATION FOR SKQ ID NO: 70: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii> MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Leu Ala lie Ala Val Leu Ser Leu Thr Leu Leu Gly Thr Phe Thr Val 
15 10 15 

Leu Glu Asn Leu Leu Val Leu Cys Val lie Leu His Ser Arg Ser Leu 
20 25 30 

Arg Cys Arg Pro Ser Tyr His Phe lie Gly Ser Leu Ala Val Ala Asp 
35 40 45 

Leu Leu Gly Ser Val lie Phe Val Tyr Ser Phe Val Asp Phe His Val 
50 55 60 

Phe His Arg Lys Asp Ser Pro Asn Val Phe Leu Phe Lys Leu Gly Gly 
65 70 75 ~ 80 
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Val Thr Ala Ser Phe Thr Ala Ser Val Gly Ser Leu Phe Leu Thr Ala 



35 (2) INFORMATION FOR SEQ ID NO:71 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) length: 349 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Lyg Ala Leu Leu lie Val Ala Tyr Ser Phe Thr He Val Phe Ser Leu 
15 10 15 

45 Phe Gly Asn Val Leu Val Cys His Tyr lie Phe Lys Asn Gin Arg Lye 
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Ser Leu Ala Ser Leu lie Pro Cys Thr Leu Leu Thr Ala Cys Phe Tyr 
65 7D 75 80 

Val Ala lie Thr Ala Ser Leu Cys Phe He Thr Glu He Ala Leu He 
85 90 95 



Val Glu Leu Met Leu Gly Ala Phe Val lis Pro Leu Ser Val He Ser 
165 170 175 



Lye Leu Leu Lys Trp He Ser Ser Ser Cys Glu Phe Glu Arg Ser Leu 
225 230 235 240 

Lye Arg Ala Leu He Leu Thr Glu Ser Leu Ala Phe Cys His Cys Cys 
245 250 255 



(2) INFORMATION FOR SEQ ID NO: 80: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Asp Asp Asp Asp Asn He Trp Ser He Phe Asp Trp He Gly Tyr Leu 
1 5 10 " 15 

Asn Ser He Ser Met Val He Tyr Thr Leu Phe Lys Lys Lys Lys 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 81: 
U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 



lie Phe Thr lie Ala Leu Ala Tyr Gly Ala Val He He Leu GLy Val 
15 10 15 

Ser Gly Asn Leu Ala Leu He He He He Leu Lys Gin Lys Qlu Leu 
20 25 30 

He Leu He Val Asn Leu Ser Phe Ser Asp Leu Leu Val Ala Val Trp 
35 40 45 

Lev Pro Phe Thr Phe Val Tyr Thr Leu He Cys His Trp Val Phe Gly 
50 55 £0 

Glu Cys Cys Lya Leu Asn Pro Phe Val Gin Cys Val Ser He Thr Val 
65 70 75 ao 

Ser He Phe Ser Leu Val Leu He Ala Val Glu Arg His Gl». Leu He 



Cys Asp Phe Arg Ser Arg Asp Gly Arg Thr Thr Arg Leu 
290 295 300 

40 (2) INFORMATION POP- SEQ ID NO: 73: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 amino acids 
IB) type: amino acid 
(C) STRANDEDNESS : single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID HO: 73: 

Leu Thr Ser Val Val Phe lie Leu lie Cys Cys Phe lie He Leu Glu 
1 5 io 1S 

Asn He Phe Val Leu Leu Thr He Trp Lye Thr Lys Lys Phe His Arq 
20 25 30 

Pro Met Tyr Tyr Phe lie Gly Asn He Ala Leu Ser Asp Leu lie Ala 
35 40 45 

Gly val Ala Tyr Thr Ala Asn Leu Leu Leu Ser Gly Ala Thr Thr Tvr 
50 55 SO 

Lys Leu Thr Pro Ala Gin Trp Phe Leu Arg Glu Gly Ser Met Phe Val 
65 ™ 75 ao 

Ala Leu Ser Leu Ser Val Phe Ser Leu Leu Ala He Ala He Glu Arg 
85 90 95 

Tyr He Thr Met Leu Lys Met Leu His Asn Gly Ser Asn Asn Phe Arg 
100 105 no 



Ser Thr Val Leu Pro Leu Tyr His Lys His Tyr He Leu Phe Cys Thr 
145 150 155 160 

Leu Ha Val Phe Thr Leu Leu Leu Leu Ser He Val He Leu Tyr Cys 
IfiS ivo !7 5 Y 

Arg He Tyr Ser Leu Val Arg Thr Arg Ser Arg Arg Leu Thr Phe Arq 
180 IBS 190 

Lys Asn He Ser Lys Ala Ser Arg Ser Ser Glu Asn Val Ala Leu Leu 
I 95 200 205 

Lys Thr Val lie He Val Leu Ser val Phe lie Ala Cys Trp Ala Pro 
210 215 220 

Leu Phe He Leu Leu Leu Leu Asp Val Gly Cys Lys Val Lye Thr Cys 
225 230 235 240 

Asp He Leu Phe Arg Ala Glu Tyr Phe Leu Val He Ala Val He Asn 
245 250 255 

Ser Gly Thr Asn Pro He He Tyr Thr Leu Thr Asn Lys Glu Met Arg 
260 265 270 

Arg Ala Phe He Arg He Met Cys Cys Lys Cys Pro Ser Gly Asp Ser 
2? 5 2B0 285 

Ala Gly Lys Phe Lys Arg Pro He He Ala Gly Met Glu Phe Ser Arq 
290 295 300 

Ser Lys Ser Asp Asn Ser Ser His Pro Gin Lys Asp Glu Gly Asp Asn 
305 31° 315 320 

Pro Glu Thr He Met Ser Ser Gly Asn Val Asn Ser Ser Ser 
325 330 

(2) INFORMATION FOR SEQ ID NO: 74: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 amino acids 
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IB) TYPE: amino acid 
(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

[xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

lie Thr Tyr Tyr He Leu He Gly Lev Cys Ala Val Val Gly Asn He 
15 10 15 

Leu Leu Val He Trp Val Val Lys Leu Asn Arg Thr Leu Arg Thr Thr 
20 25 30 

Thr Phe Tyr Phe He Val Ser He Ala Leu Ala Asp He Ala Val Leu 
35 40 45 

Val He Pro Leu Ala He Ala Ser Ala Trp Arg Ser Arg Cys Thr Ser 
50 55 60 

Asn Cys Leu Phe Met Ser Cys Val Leu Leu val Phe Thr His Ala Ser 
55 70 75 30 

He Met Ser Leu Leu Ala He Ala Val Asp Arg Tyr Leu Arg Val Lys 
85 90 95 



Thr Leu Arg Glu Phe Lys Thr Pro Lys Ser Leu Phe Leu Val Leu Phe 
145 150 155 ISO 

Leu Phe Ala Leu Cys Trp Leu Pro Leu Ser He He Asn Phe Val Ser 
165 . 170 175 

Tyr Phe Asn Val LyB He Pro Glu Thr Leu Leu Gly He Leu Leu Ser 
180 185 190 



Glu Thr Tyr Phe Val He Leu Arg Ala Cys Arg Leu Cys Gin Thr Ser 
210 215 220 



(2) INFORMATION FOR SEQ ID NO: 75: 
(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii> MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Ala He Leu He Ser Phe He Tyr Ser Trp Cys Leu Val Gly Leu Cys 
1 5 10 is 

Gly Asn Ser Met Val He Tyr Val He Leu Arg Tyr Ala Ly Met Lys 
20 25 30 

Thr Ala Thr Asn He Tyr He Leu Asn He Ala He Ala Asp Glu Leu 
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Leu Val Pro Phe Leu Val Thr Ser Thr Leu Leu Arg His Trp Pro Phe 

50 -55 SO 

Gly Ala Leu Leu Cys Arg Leu Val Leu Ser Val Asp Ala Val Asn Met 
65 70 75 ao 

Phe Thr Ser lie Tyr Cye Leu Thr Val Leu Ser Val Asp Arg Tyr Val 
85 90 95 

Ala Val Val His Pro He Lys Ala Ala Arg Tyr Arg Arg Pro Thr Val 
100 105 no 

Ala Lye Val Val Asn Leu Gly Val Trp Val Leu Ser Leu Leu Val He 
115 120 125 

Leu Pro He Trp Phe Ser Arg Thr Ala Ala Asn Ser Asp Gly Thr Val 
130 135 140 

Ala Cys Asn Met He Trp Glu Pro Ala Gin Phe Trp Leu Va_ Gly Phe 
145 ISO 155 160 

Val Leu Tyr Thr Phe Leu Met Phe Leu Leu Pro Val Gly Ala He Cvs 
165 170 175 

Leu Cys Tyr Val Leu He He Ala Lys Met Arg Met Val Ala Leu Lys 
180 185 190 

Ala Gly Trp Gin Gin Arg Lys Arg Ser Glu Arg Lys He Thr Leu Val 
195 200 205 

Met Met Val Val Met Val Phe Val He Cys Trp Phe Tyr Val Val Gin 
210 215 220 

Leu Val Asn Val Phe Ala Glu Gin Asp Asp Ala Thr Val Ser Gin Leu 
225 230 235 240 

Ser Val He Leu Gly Tyr Ala Asn Ser Cys Ala Asn Pro He Leu Tyr 
245 250 255 

Gly Phe Leu Ser Asp Asn Phe Lys Arg Ser Phe Gin Arg He Leu Cvs 
260 265 270' 

Leu Ser Leu Asn Ala Ala Glu Glu Pro Val Asp Tyr Tyr Ala Thr Ala 
275 280 285 

Leu Lys Ser Arg Ala Tyr Ser Val Glu Asp Phe Gin Pro Glu Asn Leu 
290 295 300 

Glu Ser Gly Gly Val Phe Arg Asn Cys Thr Cys Ala Ser Arg He Ser 
305 310 315 320 



(2) INFORMATION FOR SEQ ID NO: 76: 
(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 298 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: peptide 

<xi} SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Val Thr Aen Tyr He Phe Leu Leu Leu Cys Leu CyB Gly Lea Val Glv 
1 5 10 is 
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Asn Gly Leu Val Leu Trp Phe Phe Gly Phe Ser lie Lye Arg Thr Pro 
20 25 30 

Phe Ser He Tyr lie Tyr Phe Leu His He Ala Ser Ala Asp Gly He 
35 40 45 

Tyr Leu Phe Ser Lys Ala Val lie Ala Leu Leu Asn Met Gly Thr Phe 
50 55 60 

Leu Gly Ser Phe Pro Asp Tyr Val Arg Arg Val Ser Arg He Val Gly 
65 70 75 80 

Leu Thr Phe Phe Ala Gly Val Ser Leu Leu Pro Ala He Ser He Glu 



Lye Arg Leu Ser Ala Gly Val Cys Ala Leu Leu Trp Leu Leu Ser Phe 
115 120 125 

Leu Val Thr Ser He His Asn Tyr Phe Cys Leu Leu Gly His Glu Ala 
130 135 140 

Ser Gly Thr Ala Cys Leu Asn Met Asp He Ser Leu Leu Gly He Leu 
1*5 150 155 ISO 

Leu Phe Phe Leu Phe Cys Pro He Met Val Leu Pro Cys He Ala Leu 
155 170 175 

Leu His Val Glu Cys Arg Ala Arg Arg Arg Gin Arg Ser Al^ Lys Leu 
180 185 * 190 

Asn His Val Val Leu Ala He Val Ser Val Phe Leu Val Ser Ser He 
195 200 205 

Tyr Leu Gly He Asp Trp Phe Leu Phe Trp Val Phe Gin He Pro Ala 
210 215 220 

Pro Phe Pro Glu Tyr Val Arg Asp Leu Cys He Cys He Asn Ser Ser 
225 230 235 240 

Ala Lys Pro He Val Tyr Phe He Ala Gly Arg Asp Lys Ser Gin Arg 
245 250 255 

Leu Trp Glu Pro Leu Arg Val Val Phe Gin Arg Ala Leu Arg Asp Gly 
260 265 270 

Ala Glu Pro Gly Asp Ala Ala Ser Ser Thr Pro Asn Thr Val Thr Met 
275 280 285 



(2) INFORMATION FOR SBQ ID NO: 77: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 299 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Thr Thr Glu Ala Val Leu Asn Thr Phe He He Phe Val Gly Gly Pro 

15 10 is 

Ala He Val Leu He Thr Gin Leu Leu Thr Asn Arg Val Leu Gly Tyr 
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20 25 30 

Ser Thr Pre Thr lie Tyr Met Arg Asn Leu Tyr Ser Thr Asn Phe Leu 
35 40 45 

Thr Leu Thr Val Leu Pro Phe lie Val Leu Ser Asn Gin Trp Leu Leu 
50 55 60 

Pro Ala Cys Tyr Val Ala Ser Cys Lys Phe Leu Ser Val He Tyr Tyr 
65 70 75 80 

Ser Ser Cys Thr Val Gly Phe Ala Thr Val Ala Leu lie Ala Ala Asp 
85 90 95 



(2) INFORMATION FOH SEQ ID NO: 79: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2S3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
!D) TOPOLOGY: linear 

(ii) molecule TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Leu Gly Val Trp Leu Met He Val Gly Thr Phe Leu Leu Val He Thr 
15 10 15 

Thr He Leu Tyr Tyr Arg Arg Lys Lys Lys Ser Pro Ser Asri Thr Tyr 
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He Cys Asn Leu Ala Val Ala Asp Leu Leu He Val Val Gly Leu Pro 
35 40 45 

Phe Phe Leu Glu Tyr Ala Lys His His Pro Lys Leu Ser Arg Glu Val 
50 55 60 

Val Cys Ser Gly Leu Asn Ala Cys Phe Tyr He CyB Leu Phe Ala Gly 

G5 70 75 80 

Val Cys Phe Leu He Asn Leu Ser Met Asp Arg Tyr Cys Val He Val 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 79: 
(i> SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 295 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Pro Val Thr Leu Phe Leu Tyr Gly Val Val Phe Leu Phe Gly Ser He 
15 10 15 

Gly Asn PhB Leu Val He Phe Thr He Thr Trp Arg Arg Arg He Gin 
20 25 30 

Cys Ser Gly Asp Val Tyr Phe He Asn Leu Ala Ala Ala Asp Leu Leu 
35 40 45 

Phe Val Cys Thr Leu Pro Leu Trp Met Gin Tyr Leu Leu Asc His Aen 
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Ser Leu Ala Ser Leu ILe Pro Cys Thr Leu Leu Thr Ala Cys Phe Tyr 
65 7D 75 80 

Val Ala He Thr Ala Ser Leu Cys Phe He Thr Glu He Ala Leu He 
85 90 95 



Gin Ala Cys Leu Phe Ser He Phe Trp Trp He Phe Ala Val He He 
115 120 125 



Val Glu Leu Met Leu Gly Ala Phe Val He Pro Leu Ser Val He Ser 
165 170 175 

Tyr Cys Tyr Tyr Arg He Ser Arg He Val Ala Val Ser Gin Ser Arg 
180 185 190 

His Lys Gly Arg He Val Arg Val Leu He Ala Trp Leu Val Phe He 
195 200 205 



Lys Leu Leu Lys Trp He Ser Ser Ser Cys Glu Phe Glu Arg Ser Leu 
225 230 235 240 

Lys Arg Ala Leu He Leu Thr Glu Ser Leu Ala Phe Cya His Cys Cys 
245 250 255 

Leu Asn Pro Leu Leu Tyr Val Phe Val He Gly Thr Lys Phe Arg Lys 
2S0 265 270 



(2) INFORMATION FOR SEQ ID NO: SO: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE ! peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Asp Asp Asp Asp Asn He Trp Ser He Phe Asp Trp He Gly Tyr Leu 
1 5 10 15 

Asn Ser He Ser Met Val He Tyr Thr Leu Phe Lys Lys Lys Lys 
20 25 30 

(2> INFORMATION FOR SEQ ID NO: 81: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 81: 

Asp Asp Asp Asp Asn lie Trp Asn He Phe Ser Thr He Gly Tyr Leu 
5 1 5 10 15 

Asn Ser He Ser Pro Val Ser Val He Met His He Tyr Gly Lys Lys 

20 25 30 

Lys Lys 



[2) INFORMATION FOR SEQ ID NO: 82: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Asp Asp Asp Asp Gly Tyr Ser He Tyr Asp Thr Leu Val Thr Phe Ala 
15 10 15 

He Asn Pro Val Tyr He Thr Val Phe Lys Lys Lys Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 83: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

Asp Asp Asp Asp Asn Ala Trp Ser Ala Phe Asp Trp Ala Leu Tyr Leu 
15 10 15 

Asn Ser He Ser Met Ala He Tyr Thr Tyr Ala Lys Lys Lys Lys 
20 25 30 

(2> INFORMATION FOR SEQ ID NO: 84: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Bingle 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Leu Phe Ser Phe He Thr Trp Leu Gly Tyr Ala Asn ser Ser Leu Asn 
15 10 15 

Pro He He Tyr Thr Thr Phe 
20 

(2) INFORMATION FOR SEQ ID NO: 85: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

Tyr Thr He Tyr Ser Ser Ser Val Val Phe Phe Ala Pro Ser Leu Ala 
15 10 15 



He Met val He Thr Tyr Thr 

20 



WO 94/05695 



PCT/US93/08528 



- 13S - 

(2) INFORMATION FOR SEQ ID NO: 86: 
(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

He Trp Leu Thr Ser Asp He Met Ser Thr Ser Ser He Leu His Asn 
15 10 15 

Leu Cys Val He Ser Phe 
20 

(2) INFORMATION FOR SEQ ID N0:S7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

He Trp Ser He Phe Ser Ser Asp He val Val Gly Tyr Ala Asn His 
15 10 15 

Ser Ser Leu Ala He Met Cys Pro He Val He Tyr Thr Va. 

20 25 30 

<2) INFORMATION FOR SEQ ID NO: 86: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

He Phe Thr He Phe Ser Ser Asp He Ala Val Gly Tyr Ala Asn His 
15 10 15 

Ser Ser Ala Ala He Met Pro He Val He Tyr Ser Val 
20 25 

(2) INFORMATION FOR SEQ ID NO: 89: 
(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Lye Asn Ala Ser Ala Leu Leu Ser Val He He He ABn Ser He Gly 
15 10 15 

Gly Asn Val Val Thr Ala Val Ser 
20 

(2) INFORMATION FOR SEQ ID NO: 90: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Tyr Phe Leu Met Ser Leu Ala Val Thr Asp Leu Val Val Ser Phe Val 

15 10 15 
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Met Pro Val Ser Ala Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 91: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:91: 

Ala lie Thr Lye lie Ala lie Thr Trp Ala lie Ser Gly Val Ser Val 
1 5 10 15 

Pro Phe lie Pro val Trp Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Leu Gly He He Phe Gly Thr Phe He He He Trp Leu Pr- Phe Phe 
15 10 15 

He Thr Asm Leu Val Ser Pro He 
20 

(2) INFORMATION FOR SEQ ID NO:93: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

He Trp He Ser Leu Asp Val Leu Phe Ser Thr Ala Ser Ser He Met 
15 10 15 

His Leu Cys Ala He Ser Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 94: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

Gly Tyr Thr He Tyr Ser Thr Leu Val Thr Phe Tyr He Pro Ser Val 
IF 10 15 

He Met Val He Thr Tyr Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 95: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
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Leu Leu Asn Phe Phe Asn Trp lie Gly Tysr Leu Asn Ser Leu lie Asn 
15 10 15 

Pro Val He Tyr Thr Leu Phe 
20 
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WHAT IS CLAIMED IS: 

1. A G-protein coupled receptor polypeptide, consisting 
essentially of an amino acid sequence of 15 to 40 amino acids 
substantially corresponding to a fragment or consensus peptide of a 

5 transmembrane domain of a G-protein coupled receptor, wherein said 
polypeptide has a GPR- related biological activity selected from 
binding a GPR ligand or modulating GPR ligand binding to a GPR. 

2. A polypeptide according to claim l, wherein said 
polypeptide is selected from a synthetic polypeptide, a recombinant 

10 polypeptide or a purified polypeptide. 

3 . A polypeptide according to claim 1 , wherein said G- 
protein coupled receptor is a receptor selected from a cAMP receptor, 
an adenosine receptor, a j8- adrenergic receptor, a muscarinic 
acetylcholine receptor, an a-adrenergic receptor, a serotonin 

15 receptor, a histamine H2 receptor, a thrombin receptor, a kinin 
receptor, a follicle stimulating hormone receptor, an opsin, a 
rhodopsin, an odorant receptor, a cytomegalovirus receptor, or a jnas 
oncogene GPR. 

4. A polypeptide according to claim 1, wherein said 
20 transmembrane domain is selected from at least one of transmembrane 

domain TM1, TM2 , TM3, TM4 , TM5 , TM6 or TM7. 

5. A polypeptide according to claim 3, wherein said 
transmembrane . domain is a D, receptor transmembrane segment III or 
segment V. 

25 6 . A polypeptide according to claim 4, wherein said 

polypeptide has the amino acid sequence of Pig. 2 (SEQ ID NO: 2 ). 

7. A polypeptide according to claim 4, wherein said 
polypeptide has the amino acid sequence of Fig. 3 {SEQ ID NO: 3) . 

8. A polypeptide according to claim 4, wherein said 
30 polypeptide has an amino acid sequence selected from one of SEQ ID 

NOS:80-95. 

9. A polypeptide according to claim 4, wherein said 
polypeptide has an amino acid sequence of one of SEQ ID NOS:96-348. 

10 . A polypeptide according to claim 9 , wherein said 
35 polypeptide has an amino acid sequence from one of SEQ ID NOS:96-225. 
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11. A polypeptide according to claim 9, wherein said 
polypeptide has an amino acid sequence from one of SEQ ID N0S:226- 
289. 

12. A polypeptide according to claim 9, wherein said 
polypeptide has an amino acid sequence from one of SEQ ID NOS:290- 
297. 

13. A polypeptide according to claim 9, wherein said 
polypeptide has an amino acid sequence from one of SEQ ID NOS:298- 
324. 

14. A polypeptide according to claim 9, wherein said 
polypeptide has an amino acid sequence from one of SEQ ID NOS:325- 
338. 

15. A polypeptide according to claim 9, wherein said 
polypeptide has an amino acid sequence from one of SEQ ID NOS:339- 
348. 

16. A polypeptide according to claim 3, wherein said 
transmembrane domain is a dopamine receptor transmembrane domain 
selected from the group consisting of a D,, D 2 , Dj, D 4 or D 3 
transmembrane domain. 

17. A composition comprising a polypeptide according to 
claim 1, or a pharmaceutically acceptable ester, ether, sulfate, 
carbonate, glucuronide or salt thereof, and a pharmaceutically 
acceptable carrier. 

18. A composition according to claim 16, wherein said 
transmembrane domain is D 2 receptor transmembrane segment III or 
segment V. 

19. A composition according to claim 18, further 
comprising a drug selected from a phenothiazine derivative, a 
thioxanthine derivative, a butyrophenone derivative, a 
dihydroindolone, a dibenzoxazepine derivative and an atypical 
neuroleptic. 

20. A method for treating a subject suffering from a 
pathology related to an abnormality of a G-protein coupled receptor, ' 
comprising administering to said subject a therapeutically effective 
amount of composition according to claim 16. 

21. The method of claim 20, wherein said pathology is a 
psychotic disorder. 
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22. The method of claim 21, wherein said psychotic 
disorder is a schizophrenia. 

23. The method of claim 20, wherein said composition is 
administered to provide said polypeptide, fragment or consensus 

5 peptide thereof, in an amount ranging from about 0.01 jtg tc 100 mg/kg 
per day. 

24. The method of claim 23, wherein said composition is 
administered to provide said polypeptide, fragment or consensus 
peptide thereof, in an amount ranging from about 10 fig to 10 mg/kg per 

10 day. 

25. The method of claim 20, wherein said administering is 
by oral, mucosal, intravenous, intramuscular or parenteral 
administration. 

26. A method for producing a polypeptide according to 
15 claim 1, wherein said polypeptide is a recombinant polypeptide 

obtained from a recombinant host which expresses a heterologous 
nucleic acid encoding said polypeptide, comprising the steps of: 

{A) providing a host comprising a recombinant nucleic 
acid encoding a polypeptide according to claim 1 in 
20 expressible form; 

(B) culturing said host under conditions such that 
said polypeptide is expressed in recoverable amounts; and 

(C) recovering said polypeptide produced by said 

host. 

25 27. The method of claim 26, further comprising: 

{D) purifying said polypeptide. 

28. The method of claim 26, wherein said host is a 
bacteria or a eukaryotic cell. 

29. The method of claim 28, wherein said eukaryotic cell 
30 is a mammalian cell, an insect cell or a yeast cell. 

30. A method for producing a polypeptide according to 
claim 1, comprising: 

(A) chemically synthesizing a polypeptide according 
to claim l in recoverable amounts; and 
35 (B) recovering said polypeptide. 



WO 94/05695 



PCT/US93/08528 



- 142 - 

31. A method for isolating a G-protein coupled receptor, 
fragment or consensus sequence thereof, or a protein that binds the 
G-protein coupled receptor, comprising 

(A) providing a bound support, said support being 
bound to a polypeptide according to claim 1, or an 
antibody, anti- idiotype antibody, or a fragment thereof; 

(B) contacting a sample containing said G-protein 
coupled receptor or said protein that binds a G-protein 
coupled receptor to said bound support, such that said 
receptor or protein is reversibly bound to said bound 
support; and 

(C) recovering said receptor or protein that is 
attached to the bound support by dissociating the receptor 
or protein under conditions that cause elution or 
dissociation of the receptor or protein from said bound 
support . 

32. A method according to claim 31, wherein said GPR is 
a dopamine receptor. 

33. An antibody, anti- idiotype antibody or a fragment of 
said antibody or anti -idiotype antibody, that specifically displays 
an epitope of a G-protein coupled receptor polypeptide, according to 
claim 1. 

34. A recombinant nucleic acid comprising a nucleotide 
sequence encoding a G-protein coupled receptor polypeptide according 
to claim 1. 

35. A vector comprising a nucleic acid according to claim 

34. 

36. A host cell comprising the nucleic acid of claim 34. 

37. A host cell according to claim 36, wherein said host 
cell is selected from a mammalian cell, a yeast cell, a bird cell or 
an insect cell. 

38. A host cell according to claim 36, wherein, when said 
nucleic acid is expressed as said receptor polypeptide in said host 
cell, a receptor binding molecule comprising said env binding domain 
binds to said receptor polypeptide. 
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39. A host cell according to claim 37, wherein said host 
cell is a mammalian cell selected from a human cell, a primate cell 
or a rodent cell. 

40. A method for isolating a protein that binds a 
G-protein coupled receptor, comprising 

(A) providing a bound support, said support being 
bound to a polypeptide according to claim 1, or 
anti-idiotype antibody thereto; 

(B) contacting a sample containing said protein that 
binds a G-protein coupled receptor to said bound support, 
such that said protein is reversibly bound to said bound 
support ; and 

(C) recovering said protein that is attached to the 
bound support by dissociating the receptor or protein under 
conditions that cause elution or dissociation of the 
protein from said bound support. 

41. A method according to claim 40, wherein said GPR is 
a dopamine receptor. 
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